Dataset statistics
| Number of variables | 34 |
|---|---|
| Number of observations | 610895 |
| Missing cells | 3727223 |
| Missing cells (%) | 17.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 730.6 MiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Categorical | 19 |
|---|---|
| Numeric | 13 |
| Boolean | 1 |
| Unsupported | 1 |
Filed Online has constant value "True" | Constant |
ESNCAG - Boundary File has constant value "1.0" | Constant |
Central Market/Tenderloin Boundary Polygon - Updated has constant value "1.0" | Constant |
Civic Center Harm Reduction Project Boundary has constant value "1.0" | Constant |
Incident Datetime has a high cardinality: 291613 distinct values | High cardinality |
Incident Date has a high cardinality: 1750 distinct values | High cardinality |
Incident Time has a high cardinality: 1440 distinct values | High cardinality |
Report Datetime has a high cardinality: 438062 distinct values | High cardinality |
Incident Subcategory has a high cardinality: 71 distinct values | High cardinality |
Incident Description has a high cardinality: 829 distinct values | High cardinality |
Intersection has a high cardinality: 6373 distinct values | High cardinality |
Point has a high cardinality: 6460 distinct values | High cardinality |
Incident Year is highly overall correlated with Row ID and 3 other fields | High correlation |
Row ID is highly overall correlated with Incident Year and 3 other fields | High correlation |
Incident ID is highly overall correlated with Incident Year and 3 other fields | High correlation |
Incident Number is highly overall correlated with Incident Year and 3 other fields | High correlation |
CAD Number is highly overall correlated with Incident Year and 3 other fields | High correlation |
Incident Code is highly overall correlated with Incident Category and 1 other fields | High correlation |
CNN is highly overall correlated with Supervisor District and 1 other fields | High correlation |
Supervisor District is highly overall correlated with CNN and 4 other fields | High correlation |
Latitude is highly overall correlated with Supervisor District and 2 other fields | High correlation |
Longitude is highly overall correlated with Current Police Districts and 2 other fields | High correlation |
Neighborhoods is highly overall correlated with Police District and 2 other fields | High correlation |
Current Supervisor Districts is highly overall correlated with Police District and 2 other fields | High correlation |
Current Police Districts is highly overall correlated with Longitude and 3 other fields | High correlation |
Report Type Code is highly overall correlated with Report Type Description and 2 other fields | High correlation |
Report Type Description is highly overall correlated with Report Type Code and 2 other fields | High correlation |
Incident Category is highly overall correlated with Incident Code and 3 other fields | High correlation |
Incident Subcategory is highly overall correlated with Incident Code and 3 other fields | High correlation |
Police District is highly overall correlated with Supervisor District and 5 other fields | High correlation |
Analysis Neighborhood is highly overall correlated with CNN and 8 other fields | High correlation |
HSOC Zones as of 2018-06-05 is highly overall correlated with Supervisor District and 7 other fields | High correlation |
Resolution is highly imbalanced (60.8%) | Imbalance |
CAD Number has 137235 (22.5%) missing values | Missing |
Filed Online has 486720 (79.7%) missing values | Missing |
Intersection has 32624 (5.3%) missing values | Missing |
CNN has 32624 (5.3%) missing values | Missing |
Analysis Neighborhood has 32738 (5.4%) missing values | Missing |
Supervisor District has 32624 (5.3%) missing values | Missing |
Latitude has 32624 (5.3%) missing values | Missing |
Longitude has 32624 (5.3%) missing values | Missing |
Point has 32624 (5.3%) missing values | Missing |
Neighborhoods has 45029 (7.4%) missing values | Missing |
ESNCAG - Boundary File has 604133 (98.9%) missing values | Missing |
Central Market/Tenderloin Boundary Polygon - Updated has 532645 (87.2%) missing values | Missing |
Civic Center Harm Reduction Project Boundary has 532929 (87.2%) missing values | Missing |
HSOC Zones as of 2018-06-05 has 482105 (78.9%) missing values | Missing |
Invest In Neighborhoods (IIN) Areas has 610895 (100.0%) missing values | Missing |
Current Supervisor Districts has 32728 (5.4%) missing values | Missing |
Current Police Districts has 33332 (5.5%) missing values | Missing |
CAD Number is highly skewed (γ1 = 21.75604076) | Skewed |
Report Datetime is uniformly distributed | Uniform |
Invest In Neighborhoods (IIN) Areas is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-04-20 08:17:46.703200 |
|---|---|
| Analysis finished | 2023-04-20 08:19:34.841548 |
| Duration | 1 minute and 48.14 seconds |
| Download configuration | config.json |
Incident Datetime
Categorical
| Distinct | 291613 |
|---|---|
| Distinct (%) | 47.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.5 MiB |
| 23-11-2021 13:00 | 96 |
|---|---|
| 01-01-2018 00:00 | 74 |
| 01-01-2019 00:00 | 67 |
| 19-04-2022 03:30 | 60 |
| 01-01-2021 00:00 | 53 |
| Other values (291608) |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Characters and Unicode
| Total characters | 9774320 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 168278 ? |
|---|---|
| Unique (%) | 27.5% |
Sample
| 1st row | 25-07-2021 00:00 |
|---|---|
| 2nd row | 28-06-2022 23:58 |
| 3rd row | 11-03-2022 10:30 |
| 4th row | 15-05-2021 17:47 |
| 5th row | 28-06-2022 17:22 |
Common Values
| Value | Count | Frequency (%) |
| 23-11-2021 13:00 | 96 | < 0.1% |
| 01-01-2018 00:00 | 74 | < 0.1% |
| 01-01-2019 00:00 | 67 | < 0.1% |
| 19-04-2022 03:30 | 60 | < 0.1% |
| 01-01-2021 00:00 | 53 | < 0.1% |
| 01-01-2020 00:00 | 52 | < 0.1% |
| 01-02-2018 00:00 | 48 | < 0.1% |
| 01-08-2018 00:00 | 47 | < 0.1% |
| 01-04-2019 00:00 | 47 | < 0.1% |
| 01-01-2022 00:00 | 46 | < 0.1% |
| Other values (291603) | 610305 |
Length
| Value | Count | Frequency (%) |
| 00:00 | 17615 | 1.4% |
| 12:00 | 16484 | 1.3% |
| 18:00 | 12923 | 1.1% |
| 17:00 | 11808 | 1.0% |
| 20:00 | 11634 | 1.0% |
| 19:00 | 11349 | 0.9% |
| 15:00 | 9991 | 0.8% |
| 21:00 | 9898 | 0.8% |
| 16:00 | 9775 | 0.8% |
| 22:00 | 9708 | 0.8% |
| Other values (3180) | 1100605 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2315116 | |
| 2 | 1630849 | |
| 1 | 1437821 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 352716 | 3.6% |
| 3 | 349986 | 3.6% |
| 9 | 331938 | 3.4% |
| 5 | 292612 | 3.0% |
| Other values (3) | 619702 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7330740 | |
| Dash Punctuation | 1221790 | 12.5% |
| Space Separator | 610895 | 6.2% |
| Other Punctuation | 610895 | 6.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2315116 | |
| 2 | 1630849 | |
| 1 | 1437821 | |
| 8 | 352716 | 4.8% |
| 3 | 349986 | 4.8% |
| 9 | 331938 | 4.5% |
| 5 | 292612 | 4.0% |
| 4 | 247672 | 3.4% |
| 7 | 191734 | 2.6% |
| 6 | 180296 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1221790 |
Space Separator
| Value | Count | Frequency (%) |
| 610895 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 610895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9774320 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2315116 | |
| 2 | 1630849 | |
| 1 | 1437821 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 352716 | 3.6% |
| 3 | 349986 | 3.6% |
| 9 | 331938 | 3.4% |
| 5 | 292612 | 3.0% |
| Other values (3) | 619702 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9774320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2315116 | |
| 2 | 1630849 | |
| 1 | 1437821 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 352716 | 3.6% |
| 3 | 349986 | 3.6% |
| 9 | 331938 | 3.4% |
| 5 | 292612 | 3.0% |
| Other values (3) | 619702 | 6.3% |
Incident Date
Categorical
| Distinct | 1750 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.0 MiB |
| 26-06-2022 | 598 |
|---|---|
| 30-06-2019 | 578 |
| 01-08-2018 | 556 |
| 01-01-2018 | 540 |
| 02-10-2019 | 531 |
| Other values (1745) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 6108950 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25-07-2021 |
|---|---|
| 2nd row | 28-06-2022 |
| 3rd row | 11-03-2022 |
| 4th row | 15-05-2021 |
| 5th row | 28-06-2022 |
Common Values
| Value | Count | Frequency (%) |
| 26-06-2022 | 598 | 0.1% |
| 30-06-2019 | 578 | 0.1% |
| 01-08-2018 | 556 | 0.1% |
| 01-01-2018 | 540 | 0.1% |
| 02-10-2019 | 531 | 0.1% |
| 24-08-2018 | 528 | 0.1% |
| 01-02-2019 | 524 | 0.1% |
| 01-01-2020 | 519 | 0.1% |
| 03-04-2019 | 519 | 0.1% |
| 01-11-2019 | 515 | 0.1% |
| Other values (1740) | 605487 |
Length
| Value | Count | Frequency (%) |
| 26-06-2022 | 598 | 0.1% |
| 30-06-2019 | 578 | 0.1% |
| 01-08-2018 | 556 | 0.1% |
| 01-01-2018 | 540 | 0.1% |
| 02-10-2019 | 531 | 0.1% |
| 24-08-2018 | 528 | 0.1% |
| 01-02-2019 | 524 | 0.1% |
| 01-01-2020 | 519 | 0.1% |
| 03-04-2019 | 519 | 0.1% |
| 01-11-2019 | 515 | 0.1% |
| Other values (1740) | 605487 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1476602 | |
| 2 | 1363099 | |
| - | 1221790 | |
| 1 | 929933 | |
| 8 | 268586 | 4.4% |
| 9 | 251343 | 4.1% |
| 3 | 143945 | 2.4% |
| 7 | 115875 | 1.9% |
| 5 | 113384 | 1.9% |
| 6 | 112350 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4887160 | |
| Dash Punctuation | 1221790 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1476602 | |
| 2 | 1363099 | |
| 1 | 929933 | |
| 8 | 268586 | 5.5% |
| 9 | 251343 | 5.1% |
| 3 | 143945 | 2.9% |
| 7 | 115875 | 2.4% |
| 5 | 113384 | 2.3% |
| 6 | 112350 | 2.3% |
| 4 | 112043 | 2.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1221790 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6108950 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1476602 | |
| 2 | 1363099 | |
| - | 1221790 | |
| 1 | 929933 | |
| 8 | 268586 | 4.4% |
| 9 | 251343 | 4.1% |
| 3 | 143945 | 2.4% |
| 7 | 115875 | 1.9% |
| 5 | 113384 | 1.9% |
| 6 | 112350 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6108950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1476602 | |
| 2 | 1363099 | |
| - | 1221790 | |
| 1 | 929933 | |
| 8 | 268586 | 4.4% |
| 9 | 251343 | 4.1% |
| 3 | 143945 | 2.4% |
| 7 | 115875 | 1.9% |
| 5 | 113384 | 1.9% |
| 6 | 112350 | 1.8% |
Incident Time
Categorical
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 36.1 MiB |
| 00:00 | 17615 |
|---|---|
| 12:00 | 16484 |
| 18:00 | 12923 |
| 17:00 | 11808 |
| 20:00 | 11634 |
| Other values (1435) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3054475 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 00:00 |
|---|---|
| 2nd row | 23:58 |
| 3rd row | 10:30 |
| 4th row | 17:47 |
| 5th row | 17:22 |
Common Values
| Value | Count | Frequency (%) |
| 00:00 | 17615 | 2.9% |
| 12:00 | 16484 | 2.7% |
| 18:00 | 12923 | 2.1% |
| 17:00 | 11808 | 1.9% |
| 20:00 | 11634 | 1.9% |
| 19:00 | 11349 | 1.9% |
| 15:00 | 9991 | 1.6% |
| 21:00 | 9898 | 1.6% |
| 16:00 | 9775 | 1.6% |
| 22:00 | 9708 | 1.6% |
| Other values (1430) | 489710 |
Length
| Value | Count | Frequency (%) |
| 00:00 | 17615 | 2.9% |
| 12:00 | 16484 | 2.7% |
| 18:00 | 12923 | 2.1% |
| 17:00 | 11808 | 1.9% |
| 20:00 | 11634 | 1.9% |
| 19:00 | 11349 | 1.9% |
| 15:00 | 9991 | 1.6% |
| 21:00 | 9898 | 1.6% |
| 16:00 | 9775 | 1.6% |
| 22:00 | 9708 | 1.6% |
| Other values (1430) | 489710 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 838514 | |
| : | 610895 | |
| 1 | 507888 | |
| 2 | 267750 | 8.8% |
| 3 | 206041 | 6.7% |
| 5 | 179228 | 5.9% |
| 4 | 135629 | 4.4% |
| 8 | 84130 | 2.8% |
| 9 | 80595 | 2.6% |
| 7 | 75859 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2443580 | |
| Other Punctuation | 610895 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 838514 | |
| 1 | 507888 | |
| 2 | 267750 | 11.0% |
| 3 | 206041 | 8.4% |
| 5 | 179228 | 7.3% |
| 4 | 135629 | 5.6% |
| 8 | 84130 | 3.4% |
| 9 | 80595 | 3.3% |
| 7 | 75859 | 3.1% |
| 6 | 67946 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 610895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3054475 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 838514 | |
| : | 610895 | |
| 1 | 507888 | |
| 2 | 267750 | 8.8% |
| 3 | 206041 | 6.7% |
| 5 | 179228 | 5.9% |
| 4 | 135629 | 4.4% |
| 8 | 84130 | 2.8% |
| 9 | 80595 | 2.6% |
| 7 | 75859 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3054475 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 838514 | |
| : | 610895 | |
| 1 | 507888 | |
| 2 | 267750 | 8.8% |
| 3 | 206041 | 6.7% |
| 5 | 179228 | 5.9% |
| 4 | 135629 | 4.4% |
| 8 | 84130 | 2.8% |
| 9 | 80595 | 2.6% |
| 7 | 75859 | 2.5% |
Incident Year
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2019.7531 |
| Minimum | 2018 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 2018 |
|---|---|
| 5-th percentile | 2018 |
| Q1 | 2019 |
| median | 2020 |
| Q3 | 2021 |
| 95-th percentile | 2022 |
| Maximum | 2023 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4010884 |
|---|---|
| Coefficient of variation (CV) | 0.00069369293 |
| Kurtosis | -1.2837967 |
| Mean | 2019.7531 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.21485761 |
| Sum | 1.233857 × 109 |
| Variance | 1.9630488 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 152475 | |
| 2019 | 148061 | |
| 2021 | 125833 | |
| 2020 | 96369 | |
| 2022 | 88148 | |
| 2023 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 2018 | 152475 | |
| 2019 | 148061 | |
| 2020 | 96369 | |
| 2021 | 125833 | |
| 2022 | 88148 | |
| 2023 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 2023 | 9 | < 0.1% |
| 2022 | 88148 | |
| 2021 | 125833 | |
| 2020 | 96369 | |
| 2019 | 148061 | |
| 2018 | 152475 |
Incident Day of Week
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.4 MiB |
| Friday | |
|---|---|
| Wednesday | |
| Monday | |
| Thursday | |
| Saturday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.1528086 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4369615 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sunday |
|---|---|
| 2nd row | Tuesday |
| 3rd row | Friday |
| 4th row | Saturday |
| 5th row | Tuesday |
Common Values
| Value | Count | Frequency (%) |
| Friday | 93304 | |
| Wednesday | 90560 | |
| Monday | 86911 | |
| Thursday | 86627 | |
| Saturday | 86463 | |
| Tuesday | 86385 | |
| Sunday | 80645 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| friday | 93304 | |
| wednesday | 90560 | |
| monday | 86911 | |
| thursday | 86627 | |
| saturday | 86463 | |
| tuesday | 86385 | |
| sunday | 80645 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 701455 | |
| a | 697358 | |
| y | 610895 | |
| u | 340120 | |
| e | 267505 | 6.1% |
| r | 266394 | 6.1% |
| s | 263572 | 6.0% |
| n | 258116 | 5.9% |
| T | 173012 | 4.0% |
| S | 167108 | 3.8% |
| Other values (7) | 624080 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3758720 | |
| Uppercase Letter | 610895 | 14.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 701455 | |
| a | 697358 | |
| y | 610895 | |
| u | 340120 | |
| e | 267505 | 7.1% |
| r | 266394 | 7.1% |
| s | 263572 | 7.0% |
| n | 258116 | 6.9% |
| i | 93304 | 2.5% |
| o | 86911 | 2.3% |
| Other values (2) | 173090 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 173012 | |
| S | 167108 | |
| F | 93304 | |
| W | 90560 | |
| M | 86911 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4369615 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 701455 | |
| a | 697358 | |
| y | 610895 | |
| u | 340120 | |
| e | 267505 | 6.1% |
| r | 266394 | 6.1% |
| s | 263572 | 6.0% |
| n | 258116 | 5.9% |
| T | 173012 | 4.0% |
| S | 167108 | 3.8% |
| Other values (7) | 624080 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4369615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 701455 | |
| a | 697358 | |
| y | 610895 | |
| u | 340120 | |
| e | 267505 | 6.1% |
| r | 266394 | 6.1% |
| s | 263572 | 6.0% |
| n | 258116 | 5.9% |
| T | 173012 | 4.0% |
| S | 167108 | 3.8% |
| Other values (7) | 624080 |
Report Datetime
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 438062 |
|---|---|
| Distinct (%) | 71.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.5 MiB |
| 23-11-2021 13:00 | 90 |
|---|---|
| 19-04-2022 03:30 | 55 |
| 27-06-2018 07:30 | 48 |
| 27-02-2019 05:19 | 34 |
| 10-10-2019 12:00 | 33 |
| Other values (438057) |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Characters and Unicode
| Total characters | 9774320 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 320472 ? |
|---|---|
| Unique (%) | 52.5% |
Sample
| 1st row | 25-07-2021 13:41 |
|---|---|
| 2nd row | 28-06-2022 23:58 |
| 3rd row | 11-03-2022 20:03 |
| 4th row | 15-05-2021 17:47 |
| 5th row | 28-06-2022 17:22 |
Common Values
| Value | Count | Frequency (%) |
| 23-11-2021 13:00 | 90 | < 0.1% |
| 19-04-2022 03:30 | 55 | < 0.1% |
| 27-06-2018 07:30 | 48 | < 0.1% |
| 27-02-2019 05:19 | 34 | < 0.1% |
| 10-10-2019 12:00 | 33 | < 0.1% |
| 08-02-2018 19:23 | 26 | < 0.1% |
| 02-02-2019 14:13 | 24 | < 0.1% |
| 26-06-2018 10:27 | 21 | < 0.1% |
| 07-12-2018 13:00 | 21 | < 0.1% |
| 01-07-2019 18:00 | 21 | < 0.1% |
| Other values (438052) | 610522 |
Length
| Value | Count | Frequency (%) |
| 13:00 | 1959 | 0.2% |
| 15:00 | 1892 | 0.2% |
| 14:00 | 1891 | 0.2% |
| 12:00 | 1872 | 0.2% |
| 16:00 | 1772 | 0.1% |
| 11:00 | 1711 | 0.1% |
| 17:00 | 1608 | 0.1% |
| 10:00 | 1599 | 0.1% |
| 15:30 | 1481 | 0.1% |
| 09:00 | 1445 | 0.1% |
| Other values (3183) | 1204560 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1905759 | |
| 2 | 1694108 | |
| 1 | 1525990 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 378440 | 3.9% |
| 3 | 371587 | 3.8% |
| 9 | 368049 | 3.8% |
| 5 | 328675 | 3.4% |
| Other values (3) | 758132 | 7.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7330740 | |
| Dash Punctuation | 1221790 | 12.5% |
| Space Separator | 610895 | 6.2% |
| Other Punctuation | 610895 | 6.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1905759 | |
| 2 | 1694108 | |
| 1 | 1525990 | |
| 8 | 378440 | 5.2% |
| 3 | 371587 | 5.1% |
| 9 | 368049 | 5.0% |
| 5 | 328675 | 4.5% |
| 4 | 313130 | 4.3% |
| 7 | 225959 | 3.1% |
| 6 | 219043 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1221790 |
Space Separator
| Value | Count | Frequency (%) |
| 610895 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 610895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9774320 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1905759 | |
| 2 | 1694108 | |
| 1 | 1525990 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 378440 | 3.9% |
| 3 | 371587 | 3.8% |
| 9 | 368049 | 3.8% |
| 5 | 328675 | 3.4% |
| Other values (3) | 758132 | 7.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9774320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1905759 | |
| 2 | 1694108 | |
| 1 | 1525990 | |
| - | 1221790 | |
| 610895 | 6.2% | |
| : | 610895 | 6.2% |
| 8 | 378440 | 3.9% |
| 3 | 371587 | 3.8% |
| 9 | 368049 | 3.8% |
| 5 | 328675 | 3.4% |
| Other values (3) | 758132 | 7.8% |
Row ID
Real number (ℝ)
| Distinct | 419011 |
|---|---|
| Distinct (%) | 68.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0305426 × 1010 |
| Minimum | 6.1868707 × 1010 |
|---|---|
| Maximum | 1.23624 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 6.1868707 × 1010 |
|---|---|
| 5-th percentile | 6.4824482 × 1010 |
| Q1 | 7.5786316 × 1010 |
| median | 8.9439026 × 1010 |
| Q3 | 1.05262 × 1011 |
| 95-th percentile | 1.16349 × 1011 |
| Maximum | 1.23624 × 1011 |
| Range | 6.1755293 × 1010 |
| Interquartile range (IQR) | 2.9475684 × 1010 |
Descriptive statistics
| Standard deviation | 1.6706844 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.18500376 |
| Kurtosis | -1.2423387 |
| Mean | 9.0305426 × 1010 |
| Median Absolute Deviation (MAD) | 1.4736974 × 1010 |
| Skewness | 0.043507348 |
| Sum | 5.5167133 × 1016 |
| Variance | 2.7911863 × 1020 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.05983 × 1011 | 21 | < 0.1% |
| 1.00616 × 1011 | 21 | < 0.1% |
| 1.17265 × 1011 | 20 | < 0.1% |
| 1.09456 × 1011 | 20 | < 0.1% |
| 1.17418 × 1011 | 20 | < 0.1% |
| 1.09985 × 1011 | 20 | < 0.1% |
| 1.00054 × 1011 | 20 | < 0.1% |
| 1.01869 × 1011 | 20 | < 0.1% |
| 1.12407 × 1011 | 19 | < 0.1% |
| 1.18621 × 1011 | 19 | < 0.1% |
| Other values (419001) | 610695 |
| Value | Count | Frequency (%) |
| 6.186870704 × 1010 | 1 | |
| 6.186910413 × 1010 | 1 | |
| 6.18691153 × 1010 | 1 | |
| 6.186970611 × 1010 | 1 | |
| 6.18699121 × 1010 | 1 | |
| 6.187010705 × 1010 | 1 | |
| 6.187016501 × 1010 | 1 | |
| 6.187016505 × 1010 | 1 | |
| 6.187020307 × 1010 | 1 | |
| 6.1870768 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 1.23624 × 1011 | 1 | |
| 1.23607 × 1011 | 1 | |
| 1.2347 × 1011 | 1 | |
| 1.23424 × 1011 | 1 | |
| 1.23375 × 1011 | 1 | |
| 1.23355 × 1011 | 1 | |
| 1.2328 × 1011 | 1 | |
| 1.23239 × 1011 | 1 | |
| 1.23215 × 1011 | 1 | |
| 1.23161 × 1011 | 1 |
Incident ID
Real number (ℝ)
| Distinct | 512521 |
|---|---|
| Distinct (%) | 83.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 903053.92 |
| Minimum | 618687 |
|---|---|
| Maximum | 1236239 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 618687 |
|---|---|
| 5-th percentile | 648244.7 |
| Q1 | 757863 |
| median | 894390 |
| Q3 | 1052616 |
| 95-th percentile | 1163489.3 |
| Maximum | 1236239 |
| Range | 617552 |
| Interquartile range (IQR) | 294753 |
Descriptive statistics
| Standard deviation | 167068.34 |
|---|---|
| Coefficient of variation (CV) | 0.18500373 |
| Kurtosis | -1.2423385 |
| Mean | 903053.92 |
| Median Absolute Deviation (MAD) | 147368 |
| Skewness | 0.043506717 |
| Sum | 5.5167113 × 1011 |
| Variance | 2.7911831 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 640948 | 4 | < 0.1% |
| 693983 | 4 | < 0.1% |
| 908319 | 4 | < 0.1% |
| 1028735 | 4 | < 0.1% |
| 944519 | 4 | < 0.1% |
| 1078388 | 4 | < 0.1% |
| 960299 | 4 | < 0.1% |
| 884391 | 4 | < 0.1% |
| 689199 | 4 | < 0.1% |
| 632466 | 4 | < 0.1% |
| Other values (512511) | 610855 |
| Value | Count | Frequency (%) |
| 618687 | 1 | < 0.1% |
| 618691 | 2 | |
| 618697 | 1 | < 0.1% |
| 618699 | 1 | < 0.1% |
| 618701 | 3 | |
| 618702 | 1 | < 0.1% |
| 618707 | 1 | < 0.1% |
| 618709 | 2 | |
| 618710 | 3 | |
| 618711 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1236239 | 1 | |
| 1236072 | 1 | |
| 1234703 | 1 | |
| 1234236 | 1 | |
| 1233754 | 1 | |
| 1233547 | 1 | |
| 1232800 | 1 | |
| 1232386 | 1 | |
| 1232150 | 1 | |
| 1231607 | 1 |
Incident Number
Real number (ℝ)
| Distinct | 445575 |
|---|---|
| Distinct (%) | 72.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9910225 × 108 |
| Minimum | 0 |
|---|---|
| Maximum | 9.8142426 × 108 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.8024617 × 108 |
| Q1 | 1.9000774 × 108 |
| median | 2.0004958 × 108 |
| Q3 | 2.1052825 × 108 |
| 95-th percentile | 2.2049413 × 108 |
| Maximum | 9.8142426 × 108 |
| Range | 9.8142426 × 108 |
| Interquartile range (IQR) | 20520509 |
Descriptive statistics
| Standard deviation | 14541705 |
|---|---|
| Coefficient of variation (CV) | 0.07303637 |
| Kurtosis | 71.569407 |
| Mean | 1.9910225 × 108 |
| Median Absolute Deviation (MAD) | 10471194 |
| Skewness | 1.4657063 |
| Sum | 1.2163057 × 1014 |
| Variance | 2.1146119 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 190202001 | 52 | < 0.1% |
| 210330778 | 47 | < 0.1% |
| 190071345 | 34 | < 0.1% |
| 210394611 | 28 | < 0.1% |
| 210505959 | 21 | < 0.1% |
| 200080808 | 21 | < 0.1% |
| 190129518 | 20 | < 0.1% |
| 180354292 | 20 | < 0.1% |
| 190176490 | 18 | < 0.1% |
| 210724717 | 17 | < 0.1% |
| Other values (445565) | 610617 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1131000 | 1 | |
| 1808670 | 1 | |
| 1813494 | 1 | |
| 1819855 | 1 | |
| 1819873 | 1 | |
| 1831758 | 1 | |
| 1831875 | 1 | |
| 2000558 | 1 | |
| 2001459 | 1 |
| Value | Count | Frequency (%) |
| 981424262 | 1 | < 0.1% |
| 981171996 | 1 | < 0.1% |
| 970332979 | 1 | < 0.1% |
| 940072058 | 1 | < 0.1% |
| 793282725 | 1 | < 0.1% |
| 782312915 | 3 | |
| 700013570 | 1 | < 0.1% |
| 270762961 | 1 | < 0.1% |
| 251030935 | 1 | < 0.1% |
| 236011063 | 1 | < 0.1% |
CAD Number
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 351483 |
|---|---|
| Distinct (%) | 74.2% |
| Missing | 137235 |
| Missing (%) | 22.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9993409 × 108 |
| Minimum | 1 |
|---|---|
| Maximum | 1 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.8079001 × 108 |
| Q1 | 1.9013253 × 108 |
| median | 2.0030147 × 108 |
| Q3 | 2.1196002 × 108 |
| 95-th percentile | 2.2171215 × 108 |
| Maximum | 1 × 109 |
| Range | 1 × 109 |
| Interquartile range (IQR) | 21827496 |
Descriptive statistics
| Standard deviation | 22492841 |
|---|---|
| Coefficient of variation (CV) | 0.11250128 |
| Kurtosis | 770.47581 |
| Mean | 1.9993409 × 108 |
| Median Absolute Deviation (MAD) | 11501008 |
| Skewness | 21.756041 |
| Sum | 9.4700782 × 1013 |
| Variance | 5.0592788 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999999999 | 225 | < 0.1% |
| 190531525 | 35 | < 0.1% |
| 180393444 | 26 | < 0.1% |
| 200453221 | 23 | < 0.1% |
| 213631865 | 22 | < 0.1% |
| 180681633 | 20 | < 0.1% |
| 211483573 | 20 | < 0.1% |
| 220292654 | 18 | < 0.1% |
| 213081374 | 17 | < 0.1% |
| 220271692 | 17 | < 0.1% |
| Other values (351473) | 473237 | |
| (Missing) | 137235 | 22.5% |
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 18012428 | 1 | < 0.1% |
| 18165248 | 1 | < 0.1% |
| 18237257 | 1 | < 0.1% |
| 18303153 | 3 | |
| 19122287 | 2 | < 0.1% |
| 20085321 | 1 | < 0.1% |
| 20188282 | 1 | < 0.1% |
| 21164231 | 1 | < 0.1% |
| 22210522 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 999999999 | 225 | |
| 999990999 | 2 | < 0.1% |
| 982560450 | 1 | < 0.1% |
| 818632476 | 2 | < 0.1% |
| 628336061 | 1 | < 0.1% |
| 418221357 | 2 | < 0.1% |
| 400083487 | 1 | < 0.1% |
| 303181516 | 1 | < 0.1% |
| 301580764 | 1 | < 0.1% |
| 290002510 | 1 | < 0.1% |
Report Type Code
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.4 MiB |
| II | |
|---|---|
| IS | |
| VI | 37608 |
| VS | 25519 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1221790 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | II |
|---|---|
| 2nd row | VS |
| 3rd row | II |
| 4th row | VS |
| 5th row | VS |
Common Values
| Value | Count | Frequency (%) |
| II | 483861 | |
| IS | 63907 | 10.5% |
| VI | 37608 | 6.2% |
| VS | 25519 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ii | 483861 | |
| is | 63907 | 10.5% |
| vi | 37608 | 6.2% |
| vs | 25519 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1069237 | |
| S | 89426 | 7.3% |
| V | 63127 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1221790 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1069237 | |
| S | 89426 | 7.3% |
| V | 63127 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1221790 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1069237 | |
| S | 89426 | 7.3% |
| V | 63127 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1221790 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1069237 | |
| S | 89426 | 7.3% |
| V | 63127 | 5.2% |
Report Type Description
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.5 MiB |
| Initial | |
|---|---|
| Coplogic Initial | |
| Initial Supplement | |
| Vehicle Initial | |
| Vehicle Supplement | 25519 |
Length
| Max length | 19 |
|---|---|
| Median length | 7 |
| Mean length | 10.750663 |
| Min length | 7 |
Characters and Unicode
| Total characters | 6567526 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Coplogic Initial |
|---|---|
| 2nd row | Vehicle Supplement |
| 3rd row | Coplogic Initial |
| 4th row | Vehicle Supplement |
| 5th row | Vehicle Supplement |
Common Values
| Value | Count | Frequency (%) |
| Initial | 373544 | |
| Coplogic Initial | 110317 | 18.1% |
| Initial Supplement | 50049 | 8.2% |
| Vehicle Initial | 37608 | 6.2% |
| Vehicle Supplement | 25519 | 4.2% |
| Coplogic Supplement | 13858 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| initial | 571518 | |
| coplogic | 124175 | 14.6% |
| supplement | 89426 | 10.5% |
| vehicle | 63127 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1330338 | |
| l | 848246 | |
| t | 660944 | |
| n | 660944 | |
| I | 571518 | |
| a | 571518 | |
| e | 305106 | 4.6% |
| p | 303027 | 4.6% |
| o | 248350 | 3.8% |
| 237351 | 3.6% | |
| Other values (8) | 830184 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5481929 | |
| Uppercase Letter | 848246 | 12.9% |
| Space Separator | 237351 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1330338 | |
| l | 848246 | |
| t | 660944 | |
| n | 660944 | |
| a | 571518 | |
| e | 305106 | 5.6% |
| p | 303027 | 5.5% |
| o | 248350 | 4.5% |
| c | 187302 | 3.4% |
| g | 124175 | 2.3% |
| Other values (3) | 241979 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 571518 | |
| C | 124175 | 14.6% |
| S | 89426 | 10.5% |
| V | 63127 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 237351 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6330175 | |
| Common | 237351 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1330338 | |
| l | 848246 | |
| t | 660944 | |
| n | 660944 | |
| I | 571518 | |
| a | 571518 | |
| e | 305106 | 4.8% |
| p | 303027 | 4.8% |
| o | 248350 | 3.9% |
| c | 187302 | 3.0% |
| Other values (7) | 642882 |
Common
| Value | Count | Frequency (%) |
| 237351 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6567526 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1330338 | |
| l | 848246 | |
| t | 660944 | |
| n | 660944 | |
| I | 571518 | |
| a | 571518 | |
| e | 305106 | 4.6% |
| p | 303027 | 4.6% |
| o | 248350 | 3.8% |
| 237351 | 3.6% | |
| Other values (8) | 830184 |
Filed Online
Boolean
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 486720 |
| Missing (%) | 79.7% |
| Memory size | 19.1 MiB |
| True | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 124175 | 20.3% |
| (Missing) | 486720 |
Incident Code
Real number (ℝ)
| Distinct | 832 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24735.751 |
| Minimum | 1000 |
|---|---|
| Maximum | 75030 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 4134 |
| Q1 | 6244 |
| median | 7041 |
| Q3 | 51040 |
| 95-th percentile | 71024 |
| Maximum | 75030 |
| Range | 74030 |
| Interquartile range (IQR) | 44796 |
Descriptive statistics
| Standard deviation | 25703.749 |
|---|---|
| Coefficient of variation (CV) | 1.0391335 |
| Kurtosis | -0.84179728 |
| Mean | 24735.751 |
| Median Absolute Deviation (MAD) | 2907 |
| Skewness | 0.95103562 |
| Sum | 1.5110947 × 1010 |
| Variance | 6.606827 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6244 | 78103 | 12.8% |
| 28150 | 20379 | 3.3% |
| 71000 | 18383 | 3.0% |
| 4134 | 17928 | 2.9% |
| 6372 | 17433 | 2.9% |
| 7041 | 16716 | 2.7% |
| 7021 | 16136 | 2.6% |
| 6374 | 14547 | 2.4% |
| 64020 | 13831 | 2.3% |
| 28160 | 11333 | 1.9% |
| Other values (822) | 386106 |
| Value | Count | Frequency (%) |
| 1000 | 8 | < 0.1% |
| 1001 | 11 | < 0.1% |
| 1002 | 4 | < 0.1% |
| 1003 | 4 | < 0.1% |
| 1004 | 2 | < 0.1% |
| 1005 | 1 | < 0.1% |
| 1160 | 45 | |
| 2001 | 1 | < 0.1% |
| 2002 | 1 | < 0.1% |
| 2003 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 75030 | 2497 | 0.4% |
| 75025 | 2204 | 0.4% |
| 75011 | 8 | < 0.1% |
| 75000 | 6649 | |
| 74024 | 3 | < 0.1% |
| 74020 | 13 | < 0.1% |
| 74000 | 6673 | |
| 73010 | 514 | 0.1% |
| 73001 | 19 | < 0.1% |
| 73000 | 167 | < 0.1% |
Incident Category
Categorical
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 495 |
| Missing (%) | 0.1% |
| Memory size | 41.2 MiB |
| Larceny Theft | |
|---|---|
| Other Miscellaneous | |
| Malicious Mischief | |
| Assault | |
| Non-Criminal | |
| Other values (44) |
Length
| Max length | 44 |
|---|---|
| Median length | 40 |
| Mean length | 13.705729 |
| Min length | 4 |
Characters and Unicode
| Total characters | 8365977 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Larceny Theft |
|---|---|
| 2nd row | Other Offenses |
| 3rd row | Lost Property |
| 4th row | Recovered Vehicle |
| 5th row | Recovered Vehicle |
Common Values
| Value | Count | Frequency (%) |
| Larceny Theft | 187395 | |
| Other Miscellaneous | 43264 | 7.1% |
| Malicious Mischief | 41220 | 6.7% |
| Assault | 37003 | 6.1% |
| Non-Criminal | 36842 | 6.0% |
| Burglary | 33837 | 5.5% |
| Motor Vehicle Theft | 29882 | 4.9% |
| Recovered Vehicle | 22925 | 3.8% |
| Fraud | 19052 | 3.1% |
| Lost Property | 18383 | 3.0% |
| Other values (39) | 140597 |
Length
| Value | Count | Frequency (%) |
| theft | 217345 | |
| larceny | 187395 | |
| vehicle | 53529 | 4.7% |
| other | 53204 | 4.7% |
| miscellaneous | 49429 | 4.4% |
| malicious | 41220 | 3.7% |
| mischief | 41220 | 3.7% |
| assault | 37003 | 3.3% |
| non-criminal | 36842 | 3.3% |
| burglary | 33837 | 3.0% |
| Other values (62) | 377193 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 960517 | 11.5% |
| r | 611292 | 7.3% |
| 517817 | 6.2% | |
| a | 496201 | 5.9% |
| i | 480020 | 5.7% |
| n | 470069 | 5.6% |
| c | 463792 | 5.5% |
| t | 458015 | 5.5% |
| s | 428611 | 5.1% |
| h | 382288 | 4.6% |
| Other values (40) | 3097355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6645810 | |
| Uppercase Letter | 1165059 | 13.9% |
| Space Separator | 517817 | 6.2% |
| Dash Punctuation | 36842 | 0.4% |
| Other Punctuation | 207 | < 0.1% |
| Open Punctuation | 121 | < 0.1% |
| Close Punctuation | 121 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 960517 | |
| r | 611292 | 9.2% |
| a | 496201 | 7.5% |
| i | 480020 | 7.2% |
| n | 470069 | 7.1% |
| c | 463792 | 7.0% |
| t | 458015 | 6.9% |
| s | 428611 | 6.4% |
| h | 382288 | 5.8% |
| o | 357746 | 5.4% |
| Other values (15) | 1537259 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 233514 | |
| L | 205906 | |
| M | 175395 | |
| O | 98610 | |
| C | 70307 | 6.0% |
| A | 64297 | 5.5% |
| V | 61170 | 5.3% |
| R | 39467 | 3.4% |
| N | 36842 | 3.2% |
| P | 35664 | 3.1% |
| Other values (9) | 143887 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 139 | |
| ? | 68 |
Space Separator
| Value | Count | Frequency (%) |
| 517817 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36842 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 121 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 121 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7810869 | |
| Common | 555108 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 960517 | 12.3% |
| r | 611292 | 7.8% |
| a | 496201 | 6.4% |
| i | 480020 | 6.1% |
| n | 470069 | 6.0% |
| c | 463792 | 5.9% |
| t | 458015 | 5.9% |
| s | 428611 | 5.5% |
| h | 382288 | 4.9% |
| o | 357746 | 4.6% |
| Other values (34) | 2702318 |
Common
| Value | Count | Frequency (%) |
| 517817 | ||
| - | 36842 | 6.6% |
| , | 139 | < 0.1% |
| ( | 121 | < 0.1% |
| ) | 121 | < 0.1% |
| ? | 68 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8365977 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 960517 | 11.5% |
| r | 611292 | 7.3% |
| 517817 | 6.2% | |
| a | 496201 | 5.9% |
| i | 480020 | 5.7% |
| n | 470069 | 5.6% |
| c | 463792 | 5.5% |
| t | 458015 | 5.5% |
| s | 428611 | 5.1% |
| h | 382288 | 4.6% |
| Other values (40) | 3097355 |
Incident Subcategory
Categorical
HIGH CARDINALITY  HIGH CORRELATION 
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 495 |
| Missing (%) | 0.1% |
| Memory size | 42.3 MiB |
| Larceny - From Vehicle | |
|---|---|
| Other | |
| Larceny Theft - Other | |
| Vandalism | |
| Motor Vehicle Theft | 29484 |
| Other values (66) |
Length
| Max length | 40 |
|---|---|
| Median length | 29 |
| Mean length | 15.578242 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9508959 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Larceny Theft - Other |
|---|---|
| 2nd row | Other Offenses |
| 3rd row | Lost Property |
| 4th row | Recovered Vehicle |
| 5th row | Recovered Vehicle |
Common Values
| Value | Count | Frequency (%) |
| Larceny - From Vehicle | 106475 | |
| Other | 77365 | 12.7% |
| Larceny Theft - Other | 43356 | 7.1% |
| Vandalism | 40882 | 6.7% |
| Motor Vehicle Theft | 29484 | 4.8% |
| Simple Assault | 23050 | 3.8% |
| Recovered Vehicle | 22925 | 3.8% |
| Non-Criminal | 21004 | 3.4% |
| Fraud | 19980 | 3.3% |
| Lost Property | 18383 | 3.0% |
| Other values (61) | 207496 |
Length
| Value | Count | Frequency (%) |
| 227525 | ||
| larceny | 178473 | |
| vehicle | 168930 | |
| other | 144441 | 9.6% |
| from | 125630 | 8.4% |
| theft | 109004 | 7.2% |
| vandalism | 40882 | 2.7% |
| assault | 37004 | 2.5% |
| burglary | 33837 | 2.2% |
| motor | 29950 | 2.0% |
| Other values (81) | 408339 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1050525 | 11.0% |
| 893615 | 9.4% | |
| r | 785689 | 8.3% |
| i | 515380 | 5.4% |
| a | 507130 | 5.3% |
| t | 504543 | 5.3% |
| c | 442465 | 4.7% |
| h | 434534 | 4.6% |
| o | 431186 | 4.5% |
| l | 425520 | 4.5% |
| Other values (42) | 3518372 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7067732 | |
| Uppercase Letter | 1297753 | 13.6% |
| Space Separator | 893615 | 9.4% |
| Dash Punctuation | 248082 | 2.6% |
| Other Punctuation | 845 | < 0.1% |
| Open Punctuation | 466 | < 0.1% |
| Close Punctuation | 466 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1050525 | |
| r | 785689 | |
| i | 515380 | 7.3% |
| a | 507130 | 7.2% |
| t | 504543 | 7.1% |
| c | 442465 | 6.3% |
| h | 434534 | 6.1% |
| o | 431186 | 6.1% |
| l | 425520 | 6.0% |
| n | 420259 | 5.9% |
| Other values (16) | 1550501 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 231435 | |
| L | 198676 | |
| O | 168735 | |
| F | 148630 | |
| T | 122769 | |
| A | 70202 | 5.4% |
| S | 55139 | 4.2% |
| R | 52001 | 4.0% |
| M | 49726 | 3.8% |
| B | 46680 | 3.6% |
| Other values (10) | 153760 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 706 | |
| , | 139 | 16.4% |
Space Separator
| Value | Count | Frequency (%) |
| 893615 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 248082 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 466 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 466 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8365485 | |
| Common | 1143474 | 12.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1050525 | 12.6% |
| r | 785689 | 9.4% |
| i | 515380 | 6.2% |
| a | 507130 | 6.1% |
| t | 504543 | 6.0% |
| c | 442465 | 5.3% |
| h | 434534 | 5.2% |
| o | 431186 | 5.2% |
| l | 425520 | 5.1% |
| n | 420259 | 5.0% |
| Other values (36) | 2848254 |
Common
| Value | Count | Frequency (%) |
| 893615 | ||
| - | 248082 | 21.7% |
| & | 706 | 0.1% |
| ( | 466 | < 0.1% |
| ) | 466 | < 0.1% |
| , | 139 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9508959 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1050525 | 11.0% |
| 893615 | 9.4% | |
| r | 785689 | 8.3% |
| i | 515380 | 5.4% |
| a | 507130 | 5.3% |
| t | 504543 | 5.3% |
| c | 442465 | 4.7% |
| h | 434534 | 4.6% |
| o | 431186 | 4.5% |
| l | 425520 | 4.5% |
| Other values (42) | 3518372 |
Incident Description
Categorical
| Distinct | 829 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.3 MiB |
| Theft, From Locked Vehicle, >$950 | |
|---|---|
| Malicious Mischief, Vandalism to Property | 20379 |
| Lost Property | 18383 |
| Battery | 17928 |
| Theft, Other Property, $50-$200 | 17433 |
| Other values (824) |
Length
| Max length | 84 |
|---|---|
| Median length | 58 |
| Mean length | 29.320744 |
| Min length | 4 |
Characters and Unicode
| Total characters | 17911896 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 82 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Theft, Other Property, $50-$200 |
|---|---|
| 2nd row | License Plate, Recovered |
| 3rd row | Lost Property |
| 4th row | Vehicle, Recovered, Motorcycle |
| 5th row | Vehicle, Recovered, Auto |
Common Values
| Value | Count | Frequency (%) |
| Theft, From Locked Vehicle, >$950 | 78103 | 12.8% |
| Malicious Mischief, Vandalism to Property | 20379 | 3.3% |
| Lost Property | 18383 | 3.0% |
| Battery | 17928 | 2.9% |
| Theft, Other Property, $50-$200 | 17433 | 2.9% |
| Vehicle, Recovered, Auto | 16716 | 2.7% |
| Vehicle, Stolen, Auto | 16136 | 2.6% |
| Theft, Other Property, >$950 | 14547 | 2.4% |
| Mental Health Detention | 13831 | 2.3% |
| Malicious Mischief, Vandalism to Vehicle | 11333 | 1.9% |
| Other values (819) | 386106 |
Length
| Value | Count | Frequency (%) |
| vehicle | 180603 | 7.4% |
| theft | 179865 | 7.4% |
| from | 122383 | 5.0% |
| 950 | 116701 | 4.8% |
| property | 97563 | 4.0% |
| locked | 92886 | 3.8% |
| other | 58170 | 2.4% |
| to | 46991 | 1.9% |
| stolen | 40618 | 1.7% |
| malicious | 39217 | 1.6% |
| Other values (912) | 1451277 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1830806 | 10.2% | |
| e | 1801589 | 10.1% |
| r | 1076202 | 6.0% |
| o | 1071925 | 6.0% |
| t | 1060662 | 5.9% |
| i | 950707 | 5.3% |
| , | 754953 | 4.2% |
| c | 690176 | 3.9% |
| n | 689963 | 3.9% |
| l | 669866 | 3.7% |
| Other values (63) | 7315047 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12066864 | |
| Uppercase Letter | 2138410 | 11.9% |
| Space Separator | 1830806 | 10.2% |
| Other Punctuation | 814089 | 4.5% |
| Decimal Number | 633473 | 3.5% |
| Currency Symbol | 222273 | 1.2% |
| Math Symbol | 123371 | 0.7% |
| Dash Punctuation | 56298 | 0.3% |
| Open Punctuation | 13156 | 0.1% |
| Close Punctuation | 13156 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1801589 | |
| r | 1076202 | 8.9% |
| o | 1071925 | 8.9% |
| t | 1060662 | 8.8% |
| i | 950707 | 7.9% |
| c | 690176 | 5.7% |
| n | 689963 | 5.7% |
| l | 669866 | 5.6% |
| a | 632693 | 5.2% |
| s | 592072 | 4.9% |
| Other values (16) | 2831009 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 243672 | |
| V | 232734 | |
| F | 221075 | |
| P | 202608 | |
| L | 148509 | 6.9% |
| A | 143141 | 6.7% |
| M | 132149 | 6.2% |
| S | 123931 | 5.8% |
| O | 107231 | 5.0% |
| B | 93895 | 4.4% |
| Other values (13) | 489465 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 271892 | |
| 5 | 172817 | |
| 9 | 139051 | |
| 2 | 49457 | 7.8% |
| 1 | 185 | < 0.1% |
| 3 | 23 | < 0.1% |
| 6 | 20 | < 0.1% |
| 8 | 17 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 754953 | |
| / | 29014 | 3.6% |
| . | 26138 | 3.2% |
| & | 3903 | 0.5% |
| " | 72 | < 0.1% |
| ' | 5 | < 0.1% |
| ; | 4 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 116701 | |
| < | 6670 | 5.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1830806 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 222273 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 56298 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13156 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14205274 | |
| Common | 3706622 | 20.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1801589 | 12.7% |
| r | 1076202 | 7.6% |
| o | 1071925 | 7.5% |
| t | 1060662 | 7.5% |
| i | 950707 | 6.7% |
| c | 690176 | 4.9% |
| n | 689963 | 4.9% |
| l | 669866 | 4.7% |
| a | 632693 | 4.5% |
| s | 592072 | 4.2% |
| Other values (39) | 4969419 |
Common
| Value | Count | Frequency (%) |
| 1830806 | ||
| , | 754953 | |
| 0 | 271892 | 7.3% |
| $ | 222273 | 6.0% |
| 5 | 172817 | 4.7% |
| 9 | 139051 | 3.8% |
| > | 116701 | 3.1% |
| - | 56298 | 1.5% |
| 2 | 49457 | 1.3% |
| / | 29014 | 0.8% |
| Other values (14) | 63360 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17911896 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1830806 | 10.2% | |
| e | 1801589 | 10.1% |
| r | 1076202 | 6.0% |
| o | 1071925 | 6.0% |
| t | 1060662 | 5.9% |
| i | 950707 | 5.3% |
| , | 754953 | 4.2% |
| c | 690176 | 3.9% |
| n | 689963 | 3.9% |
| l | 669866 | 3.7% |
| Other values (63) | 7315047 |
Resolution
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.0 MiB |
| Open or Active | |
|---|---|
| Cite or Arrest Adult | |
| Unfounded | 3403 |
| Exceptional Adult | 1592 |
Length
| Max length | 20 |
|---|---|
| Median length | 14 |
| Mean length | 15.14885 |
| Min length | 9 |
Characters and Unicode
| Total characters | 9254357 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Open or Active |
|---|---|
| 2nd row | Open or Active |
| 3rd row | Open or Active |
| 4th row | Open or Active |
| 5th row | Open or Active |
Common Values
| Value | Count | Frequency (%) |
| Open or Active | 486889 | |
| Cite or Arrest Adult | 119011 | 19.5% |
| Unfounded | 3403 | 0.6% |
| Exceptional Adult | 1592 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| or | 605900 | |
| open | 486889 | |
| active | 486889 | |
| adult | 120603 | 6.2% |
| cite | 119011 | 6.1% |
| arrest | 119011 | 6.1% |
| unfounded | 3403 | 0.2% |
| exceptional | 1592 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1332403 | ||
| e | 1216795 | |
| t | 847106 | |
| r | 843922 | |
| A | 726503 | |
| o | 610895 | 6.6% |
| i | 607492 | 6.6% |
| n | 495287 | 5.4% |
| p | 488481 | 5.3% |
| c | 488481 | 5.3% |
| Other values (12) | 1596992 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6584556 | |
| Uppercase Letter | 1337398 | 14.5% |
| Space Separator | 1332403 | 14.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1216795 | |
| t | 847106 | |
| r | 843922 | |
| o | 610895 | |
| i | 607492 | |
| n | 495287 | |
| p | 488481 | |
| c | 488481 | |
| v | 486889 | |
| d | 127409 | 1.9% |
| Other values (6) | 371799 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 726503 | |
| O | 486889 | |
| C | 119011 | 8.9% |
| U | 3403 | 0.3% |
| E | 1592 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1332403 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7921954 | |
| Common | 1332403 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1216795 | |
| t | 847106 | |
| r | 843922 | |
| A | 726503 | |
| o | 610895 | |
| i | 607492 | |
| n | 495287 | |
| p | 488481 | |
| c | 488481 | |
| O | 486889 | |
| Other values (11) | 1110103 |
Common
| Value | Count | Frequency (%) |
| 1332403 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9254357 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1332403 | ||
| e | 1216795 | |
| t | 847106 | |
| r | 843922 | |
| A | 726503 | |
| o | 610895 | 6.6% |
| i | 607492 | 6.6% |
| n | 495287 | 5.4% |
| p | 488481 | 5.3% |
| c | 488481 | 5.3% |
| Other values (12) | 1596992 |
Intersection
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 6373 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Memory size | 45.2 MiB |
| MARKET ST \ POWELL ST | 3545 |
|---|---|
| POWELL ST \ OFARRELL ST | 2974 |
| BOARDMAN PL \ BRYANT ST | 2903 |
| EDDY ST \ JONES ST | 2575 |
| 20TH AVE \ WINSTON DR | 2414 |
| Other values (6368) |
Length
| Max length | 84 |
|---|---|
| Median length | 60 |
| Mean length | 23.218145 |
| Min length | 12 |
Characters and Unicode
| Total characters | 13426380 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | EXCELSIOR AVE \ MISSION ST |
|---|---|
| 2nd row | NORTH POINT ST \ LARKIN ST |
| 3rd row | FELL ST \ ASHBURY ST |
| 4th row | FILLMORE ST \ SACRAMENTO ST |
| 5th row | JERROLD AVE \ PHELPS ST |
Common Values
| Value | Count | Frequency (%) |
| MARKET ST \ POWELL ST | 3545 | 0.6% |
| POWELL ST \ OFARRELL ST | 2974 | 0.5% |
| BOARDMAN PL \ BRYANT ST | 2903 | 0.5% |
| EDDY ST \ JONES ST | 2575 | 0.4% |
| 20TH AVE \ WINSTON DR | 2414 | 0.4% |
| 16TH ST \ MISSION ST | 2391 | 0.4% |
| 08TH ST \ GROVE ST \ HYDE ST \ MARKET ST | 2333 | 0.4% |
| 04TH ST \ LONG BRIDGE ST | 2113 | 0.3% |
| HYDE ST \ TURK ST | 2002 | 0.3% |
| UNITED NATIONS PLZ \ LEAVENWORTH ST | 1999 | 0.3% |
| Other values (6363) | 553022 | |
| (Missing) | 32624 | 5.3% |
Length
| Value | Count | Frequency (%) |
| st | 885804 | |
| 621448 | ||
| ave | 190737 | 6.1% |
| mission | 29678 | 1.0% |
| dr | 24894 | 0.8% |
| blvd | 23201 | 0.7% |
| market | 22015 | 0.7% |
| eddy | 15317 | 0.5% |
| way | 15261 | 0.5% |
| geary | 14715 | 0.5% |
| Other values (2051) | 1275809 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2540608 | ||
| T | 1435301 | |
| S | 1328829 | 9.9% |
| A | 930273 | 6.9% |
| E | 861040 | 6.4% |
| \ | 621448 | 4.6% |
| N | 613583 | 4.6% |
| R | 589051 | 4.4% |
| O | 583456 | 4.3% |
| L | 523076 | 3.9% |
| Other values (29) | 3399715 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9901346 | |
| Space Separator | 2540608 | 18.9% |
| Other Punctuation | 621448 | 4.6% |
| Decimal Number | 362541 | 2.7% |
| Dash Punctuation | 437 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1435301 | |
| S | 1328829 | |
| A | 930273 | |
| E | 861040 | 8.7% |
| N | 613583 | 6.2% |
| R | 589051 | 5.9% |
| O | 583456 | 5.9% |
| L | 523076 | 5.3% |
| I | 401337 | 4.1% |
| H | 345321 | 3.5% |
| Other values (16) | 2290079 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 77633 | |
| 1 | 67359 | |
| 2 | 59822 | |
| 4 | 32315 | |
| 6 | 27808 | 7.7% |
| 3 | 26658 | 7.4% |
| 8 | 19609 | 5.4% |
| 9 | 17573 | 4.8% |
| 7 | 17160 | 4.7% |
| 5 | 16604 | 4.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2540608 |
Other Punctuation
| Value | Count | Frequency (%) |
| \ | 621448 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9901346 | |
| Common | 3525034 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 1435301 | |
| S | 1328829 | |
| A | 930273 | |
| E | 861040 | 8.7% |
| N | 613583 | 6.2% |
| R | 589051 | 5.9% |
| O | 583456 | 5.9% |
| L | 523076 | 5.3% |
| I | 401337 | 4.1% |
| H | 345321 | 3.5% |
| Other values (16) | 2290079 |
Common
| Value | Count | Frequency (%) |
| 2540608 | ||
| \ | 621448 | 17.6% |
| 0 | 77633 | 2.2% |
| 1 | 67359 | 1.9% |
| 2 | 59822 | 1.7% |
| 4 | 32315 | 0.9% |
| 6 | 27808 | 0.8% |
| 3 | 26658 | 0.8% |
| 8 | 19609 | 0.6% |
| 9 | 17573 | 0.5% |
| Other values (3) | 34201 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13426380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2540608 | ||
| T | 1435301 | |
| S | 1328829 | 9.9% |
| A | 930273 | 6.9% |
| E | 861040 | 6.4% |
| \ | 621448 | 4.6% |
| N | 613583 | 4.6% |
| R | 589051 | 4.4% |
| O | 583456 | 4.3% |
| L | 523076 | 3.9% |
| Other values (29) | 3399715 |
CNN
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 6460 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25330875 |
| Minimum | 20013000 |
|---|---|
| Maximum | 54203000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 20013000 |
|---|---|
| 5-th percentile | 20618000 |
| Q1 | 23967000 |
| median | 24924000 |
| Q3 | 26469000 |
| 95-th percentile | 33229000 |
| Maximum | 54203000 |
| Range | 34190000 |
| Interquartile range (IQR) | 2502000 |
Descriptive statistics
| Standard deviation | 3095514.3 |
|---|---|
| Coefficient of variation (CV) | 0.12220321 |
| Kurtosis | 5.5925796 |
| Mean | 25330875 |
| Median Absolute Deviation (MAD) | 1120000 |
| Skewness | 1.4918742 |
| Sum | 1.4648111 × 1013 |
| Variance | 9.5822087 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34016000 | 3545 | 0.6% |
| 24904000 | 2974 | 0.5% |
| 23914000 | 2903 | 0.5% |
| 24929000 | 2575 | 0.4% |
| 33719000 | 2414 | 0.4% |
| 24170000 | 2391 | 0.4% |
| 24429000 | 2333 | 0.4% |
| 34168000 | 2113 | 0.3% |
| 24933000 | 2002 | 0.3% |
| 30044000 | 1999 | 0.3% |
| Other values (6450) | 553022 | |
| (Missing) | 32624 | 5.3% |
| Value | Count | Frequency (%) |
| 20013000 | 139 | |
| 20034000 | 84 | < 0.1% |
| 20039000 | 76 | < 0.1% |
| 20041000 | 203 | |
| 20044000 | 108 | < 0.1% |
| 20046000 | 172 | |
| 20056000 | 170 | |
| 20058000 | 234 | |
| 20060000 | 326 | |
| 20061000 | 87 | < 0.1% |
| Value | Count | Frequency (%) |
| 54203000 | 1 | < 0.1% |
| 54122000 | 5 | < 0.1% |
| 54004000 | 4 | < 0.1% |
| 51555000 | 3 | < 0.1% |
| 51545000 | 1 | < 0.1% |
| 51541000 | 20 | |
| 51535000 | 3 | < 0.1% |
| 51527000 | 12 | < 0.1% |
| 51484000 | 3 | < 0.1% |
| 51483000 | 41 |
Police District
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.7 MiB |
| Central | |
|---|---|
| Northern | |
| Mission | |
| Southern | |
| Tenderloin | |
| Other values (6) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.674363 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4688230 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Southern |
|---|---|
| 2nd row | Out of SF |
| 3rd row | Central |
| 4th row | Out of SF |
| 5th row | Out of SF |
Common Values
| Value | Count | Frequency (%) |
| Central | 91668 | |
| Northern | 82733 | |
| Mission | 77568 | |
| Southern | 74473 | |
| Tenderloin | 58467 | |
| Bayview | 53473 | |
| Ingleside | 45495 | |
| Taraval | 42760 | |
| Richmond | 38028 | |
| Park | 28424 | 4.7% |
Length
| Value | Count | Frequency (%) |
| central | 91668 | |
| northern | 82733 | |
| mission | 77568 | |
| southern | 74473 | |
| tenderloin | 58467 | |
| bayview | 53473 | |
| ingleside | 45495 | |
| taraval | 42760 | |
| richmond | 38028 | |
| park | 28424 | 4.4% |
| Other values (3) | 53418 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 526899 | |
| e | 510271 | 10.9% |
| r | 461258 | 9.8% |
| i | 350599 | 7.5% |
| o | 349075 | 7.4% |
| a | 301845 | 6.4% |
| t | 266680 | 5.7% |
| l | 238390 | 5.1% |
| s | 200631 | 4.3% |
| h | 195234 | 4.2% |
| Other values (22) | 1287348 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4006111 | |
| Uppercase Letter | 646507 | 13.8% |
| Space Separator | 35612 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 526899 | |
| e | 510271 | |
| r | 461258 | |
| i | 350599 | |
| o | 349075 | |
| a | 301845 | |
| t | 266680 | |
| l | 238390 | 6.0% |
| s | 200631 | 5.0% |
| h | 195234 | 4.9% |
| Other values (10) | 605229 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 101227 | |
| S | 92279 | |
| C | 91668 | |
| N | 82733 | |
| M | 77568 | |
| B | 53473 | |
| I | 45495 | |
| R | 38028 | 5.9% |
| P | 28424 | 4.4% |
| O | 17806 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 35612 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4652618 | |
| Common | 35612 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 526899 | |
| e | 510271 | |
| r | 461258 | 9.9% |
| i | 350599 | 7.5% |
| o | 349075 | 7.5% |
| a | 301845 | 6.5% |
| t | 266680 | 5.7% |
| l | 238390 | 5.1% |
| s | 200631 | 4.3% |
| h | 195234 | 4.2% |
| Other values (21) | 1251736 |
Common
| Value | Count | Frequency (%) |
| 35612 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4688230 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 526899 | |
| e | 510271 | 10.9% |
| r | 461258 | 9.8% |
| i | 350599 | 7.5% |
| o | 349075 | 7.4% |
| a | 301845 | 6.4% |
| t | 266680 | 5.7% |
| l | 238390 | 5.1% |
| s | 200631 | 4.3% |
| h | 195234 | 4.2% |
| Other values (22) | 1287348 |
Analysis Neighborhood
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 41 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32738 |
| Missing (%) | 5.4% |
| Memory size | 40.2 MiB |
| Mission | |
|---|---|
| Tenderloin | |
| Financial District/South Beach | |
| South of Market | |
| Bayview Hunters Point | |
| Other values (36) |
Length
| Max length | 30 |
|---|---|
| Median length | 18 |
| Mean length | 14.070759 |
| Min length | 6 |
Characters and Unicode
| Total characters | 8135108 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Excelsior |
|---|---|
| 2nd row | Russian Hill |
| 3rd row | Lone Mountain/USF |
| 4th row | Pacific Heights |
| 5th row | Bayview Hunters Point |
Common Values
| Value | Count | Frequency (%) |
| Mission | 62476 | 10.2% |
| Tenderloin | 59256 | 9.7% |
| Financial District/South Beach | 48559 | 7.9% |
| South of Market | 47079 | 7.7% |
| Bayview Hunters Point | 37497 | 6.1% |
| North Beach | 19302 | 3.2% |
| Western Addition | 18639 | 3.1% |
| Castro/Upper Market | 17481 | 2.9% |
| Sunset/Parkside | 17211 | 2.8% |
| Nob Hill | 16667 | 2.7% |
| Other values (31) | 233990 | |
| (Missing) | 32738 | 5.4% |
Length
| Value | Count | Frequency (%) |
| mission | 81260 | 7.3% |
| beach | 67861 | 6.1% |
| market | 64560 | 5.8% |
| tenderloin | 59256 | 5.3% |
| of | 58817 | 5.3% |
| financial | 48559 | 4.4% |
| district/south | 48559 | 4.4% |
| south | 47079 | 4.2% |
| hill | 40529 | 3.6% |
| point | 37497 | 3.4% |
| Other values (46) | 560815 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 778052 | 9.6% |
| e | 676283 | 8.3% |
| n | 628962 | 7.7% |
| t | 551608 | 6.8% |
| 536635 | 6.6% | |
| o | 531311 | 6.5% |
| a | 527962 | 6.5% |
| s | 469918 | 5.8% |
| r | 443813 | 5.5% |
| l | 292033 | 3.6% |
| Other values (36) | 2698531 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6322067 | |
| Uppercase Letter | 1173594 | 14.4% |
| Space Separator | 536635 | 6.6% |
| Other Punctuation | 102812 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 778052 | |
| e | 676283 | |
| n | 628962 | |
| t | 551608 | |
| o | 531311 | |
| a | 527962 | |
| s | 469918 | |
| r | 443813 | 7.0% |
| l | 292033 | 4.6% |
| h | 266229 | 4.2% |
| Other values (13) | 1155896 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 176132 | |
| H | 128138 | |
| S | 128087 | |
| B | 126589 | |
| P | 112057 | |
| T | 76111 | 6.5% |
| F | 55618 | 4.7% |
| D | 48559 | 4.1% |
| N | 42184 | 3.6% |
| R | 35191 | 3.0% |
| Other values (11) | 244928 |
Space Separator
| Value | Count | Frequency (%) |
| 536635 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 102812 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7495661 | |
| Common | 639447 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 778052 | 10.4% |
| e | 676283 | 9.0% |
| n | 628962 | 8.4% |
| t | 551608 | 7.4% |
| o | 531311 | 7.1% |
| a | 527962 | 7.0% |
| s | 469918 | 6.3% |
| r | 443813 | 5.9% |
| l | 292033 | 3.9% |
| h | 266229 | 3.6% |
| Other values (34) | 2329490 |
Common
| Value | Count | Frequency (%) |
| 536635 | ||
| / | 102812 | 16.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8135108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 778052 | 9.6% |
| e | 676283 | 8.3% |
| n | 628962 | 7.7% |
| t | 551608 | 6.8% |
| 536635 | 6.6% | |
| o | 531311 | 6.5% |
| a | 527962 | 6.5% |
| s | 469918 | 5.8% |
| r | 443813 | 5.5% |
| l | 292033 | 3.6% |
| Other values (36) | 2698531 |
Supervisor District
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9612223 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8032855 |
|---|---|
| Coefficient of variation (CV) | 0.47025347 |
| Kurtosis | -1.0250774 |
| Mean | 5.9612223 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.026484574 |
| Sum | 3447202 |
| Variance | 7.8584095 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 135319 | |
| 3 | 86050 | |
| 10 | 62768 | |
| 5 | 58887 | |
| 9 | 57260 | |
| 2 | 44144 | 7.2% |
| 8 | 44095 | 7.2% |
| 1 | 27563 | 4.5% |
| 7 | 24609 | 4.0% |
| 11 | 21215 | 3.5% |
| (Missing) | 32624 | 5.3% |
| Value | Count | Frequency (%) |
| 1 | 27563 | 4.5% |
| 2 | 44144 | 7.2% |
| 3 | 86050 | |
| 4 | 16361 | 2.7% |
| 5 | 58887 | |
| 6 | 135319 | |
| 7 | 24609 | 4.0% |
| 8 | 44095 | 7.2% |
| 9 | 57260 | |
| 10 | 62768 |
| Value | Count | Frequency (%) |
| 11 | 21215 | 3.5% |
| 10 | 62768 | |
| 9 | 57260 | |
| 8 | 44095 | 7.2% |
| 7 | 24609 | 4.0% |
| 6 | 135319 | |
| 5 | 58887 | |
| 4 | 16361 | 2.7% |
| 3 | 86050 | |
| 2 | 44144 | 7.2% |
Latitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 6458 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.769339 |
| Minimum | 37.707988 |
|---|---|
| Maximum | 37.829991 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 37.707988 |
|---|---|
| 5-th percentile | 37.720724 |
| Q1 | 37.755295 |
| median | 37.775894 |
| Q3 | 37.785893 |
| 95-th percentile | 37.802791 |
| Maximum | 37.829991 |
| Range | 0.12200249 |
| Interquartile range (IQR) | 0.03059812 |
Descriptive statistics
| Standard deviation | 0.024366596 |
|---|---|
| Coefficient of variation (CV) | 0.00064514224 |
| Kurtosis | -0.296199 |
| Mean | 37.769339 |
| Median Absolute Deviation (MAD) | 0.01301542 |
| Skewness | -0.68109821 |
| Sum | 21840913 |
| Variance | 0.00059373098 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.78456014 | 3545 | 0.6% |
| 37.78640961 | 2974 | 0.5% |
| 37.77516081 | 2903 | 0.5% |
| 37.78393258 | 2575 | 0.4% |
| 37.72694991 | 2414 | 0.4% |
| 37.76505134 | 2391 | 0.4% |
| 37.77871943 | 2333 | 0.4% |
| 37.77346692 | 2113 | 0.3% |
| 37.78258503 | 2002 | 0.3% |
| 37.77999174 | 1999 | 0.3% |
| Other values (6448) | 553022 | |
| (Missing) | 32624 | 5.3% |
| Value | Count | Frequency (%) |
| 37.70798826 | 10 | < 0.1% |
| 37.70802018 | 62 | |
| 37.70805761 | 31 | < 0.1% |
| 37.7082148 | 6 | < 0.1% |
| 37.70825596 | 62 | |
| 37.70830771 | 1 | < 0.1% |
| 37.70831127 | 108 | |
| 37.70832812 | 17 | < 0.1% |
| 37.70835434 | 22 | < 0.1% |
| 37.70844468 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 37.82999075 | 127 | |
| 37.82979158 | 31 | < 0.1% |
| 37.8296623 | 53 | |
| 37.82961662 | 62 | |
| 37.82954858 | 129 | |
| 37.82944921 | 57 | |
| 37.82911002 | 40 | < 0.1% |
| 37.82908934 | 77 | |
| 37.82834123 | 25 | < 0.1% |
| 37.82788815 | 55 |
Longitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 6436 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.42392 |
| Minimum | -122.51129 |
|---|---|
| Maximum | -122.36374 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 578271 |
| Negative (%) | 94.7% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | -122.51129 |
|---|---|
| 5-th percentile | -122.48045 |
| Q1 | -122.4344 |
| median | -122.41771 |
| Q3 | -122.40729 |
| 95-th percentile | -122.3911 |
| Maximum | -122.36374 |
| Range | 0.1475521 |
| Interquartile range (IQR) | 0.0271135 |
Descriptive statistics
| Standard deviation | 0.026350605 |
|---|---|
| Coefficient of variation (CV) | -0.00021524066 |
| Kurtosis | 1.1965235 |
| Mean | -122.42392 |
| Median Absolute Deviation (MAD) | 0.0125638 |
| Skewness | -1.1518327 |
| Sum | -70794201 |
| Variance | 0.00069435436 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.407337 | 3545 | 0.6% |
| -122.4080362 | 2974 | 0.5% |
| -122.4036355 | 2903 | 0.5% |
| -122.4125953 | 2575 | 0.4% |
| -122.4760395 | 2414 | 0.4% |
| -122.419669 | 2391 | 0.4% |
| -122.4147412 | 2333 | 0.4% |
| -122.3914343 | 2113 | 0.3% |
| -122.4156939 | 2002 | 0.3% |
| -122.4134874 | 1999 | 0.3% |
| Other values (6426) | 553022 | |
| (Missing) | 32624 | 5.3% |
| Value | Count | Frequency (%) |
| -122.5112949 | 1363 | |
| -122.5103413 | 20 | < 0.1% |
| -122.5101688 | 99 | < 0.1% |
| -122.510037 | 70 | < 0.1% |
| -122.5098948 | 1137 | |
| -122.5098792 | 177 | < 0.1% |
| -122.5096222 | 33 | < 0.1% |
| -122.5094329 | 188 | < 0.1% |
| -122.5094022 | 315 | 0.1% |
| -122.5093683 | 21 | < 0.1% |
| Value | Count | Frequency (%) |
| -122.3637428 | 164 | < 0.1% |
| -122.36843 | 52 | < 0.1% |
| -122.3690371 | 46 | < 0.1% |
| -122.3691332 | 3 | < 0.1% |
| -122.3695409 | 11 | < 0.1% |
| -122.3696925 | 32 | < 0.1% |
| -122.3703524 | 84 | < 0.1% |
| -122.3707119 | 54 | < 0.1% |
| -122.3708198 | 46 | < 0.1% |
| -122.3712459 | 448 |
Point
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 6460 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 32624 |
| Missing (%) | 5.3% |
| Memory size | 57.3 MiB |
| POINT (-122.40733704162238 37.784560141211806) | 3545 |
|---|---|
| POINT (-122.40803623744476 37.78640961281089) | 2974 |
| POINT (-122.40363551943442 37.7751608100771) | 2903 |
| POINT (-122.41259527758581 37.7839325760642) | 2575 |
| POINT (-122.47603947349434 37.72694991292525) | 2414 |
| Other values (6455) |
Length
| Max length | 46 |
|---|---|
| Median length | 45 |
| Mean length | 45.043824 |
| Min length | 41 |
Characters and Unicode
| Total characters | 26047537 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | POINT (-122.43362359230944 37.72623624315635) |
|---|---|
| 2nd row | POINT (-122.42200682265661 37.80549664761133) |
| 3rd row | POINT (-122.44749724585684 37.77279045274103) |
| 4th row | POINT (-122.43402709034117 37.78983697125977) |
| 5th row | POINT (-122.39126842523832 37.73985319897475) |
Common Values
| Value | Count | Frequency (%) |
| POINT (-122.40733704162238 37.784560141211806) | 3545 | 0.6% |
| POINT (-122.40803623744476 37.78640961281089) | 2974 | 0.5% |
| POINT (-122.40363551943442 37.7751608100771) | 2903 | 0.5% |
| POINT (-122.41259527758581 37.7839325760642) | 2575 | 0.4% |
| POINT (-122.47603947349434 37.72694991292525) | 2414 | 0.4% |
| POINT (-122.41966897380142 37.76505133632968) | 2391 | 0.4% |
| POINT (-122.4147412230519 37.77871942789032) | 2333 | 0.4% |
| POINT (-122.39143433652146 37.773466920607476) | 2113 | 0.3% |
| POINT (-122.41569387441227 37.78258503232177) | 2002 | 0.3% |
| POINT (-122.41348740024354 37.77999173926721) | 1999 | 0.3% |
| Other values (6450) | 553022 | |
| (Missing) | 32624 | 5.3% |
Length
| Value | Count | Frequency (%) |
| point | 578271 | |
| 37.784560141211806 | 3545 | 0.2% |
| 122.40733704162238 | 3545 | 0.2% |
| 122.40803623744476 | 2974 | 0.2% |
| 37.78640961281089 | 2974 | 0.2% |
| 122.40363551943442 | 2903 | 0.2% |
| 37.7751608100771 | 2903 | 0.2% |
| 122.41259527758581 | 2575 | 0.1% |
| 37.7839325760642 | 2575 | 0.1% |
| 122.47603947349434 | 2414 | 0.1% |
| Other values (12911) | 1130134 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2642702 | 10.1% |
| 7 | 2621506 | 10.1% |
| 3 | 2128195 | 8.2% |
| 1 | 2120749 | 8.1% |
| 4 | 2023871 | 7.8% |
| 8 | 1609631 | 6.2% |
| 6 | 1522751 | 5.8% |
| 5 | 1511264 | 5.8% |
| 9 | 1481118 | 5.7% |
| 0 | 1446498 | 5.6% |
| Other values (10) | 6939252 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19108285 | |
| Uppercase Letter | 2891355 | 11.1% |
| Other Punctuation | 1156542 | 4.4% |
| Space Separator | 1156542 | 4.4% |
| Dash Punctuation | 578271 | 2.2% |
| Open Punctuation | 578271 | 2.2% |
| Close Punctuation | 578271 | 2.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2642702 | |
| 7 | 2621506 | |
| 3 | 2128195 | |
| 1 | 2120749 | |
| 4 | 2023871 | |
| 8 | 1609631 | |
| 6 | 1522751 | |
| 5 | 1511264 | |
| 9 | 1481118 | |
| 0 | 1446498 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 578271 | |
| T | 578271 | |
| N | 578271 | |
| I | 578271 | |
| P | 578271 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1156542 |
Space Separator
| Value | Count | Frequency (%) |
| 1156542 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 578271 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 578271 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 578271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23156182 | |
| Latin | 2891355 | 11.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2642702 | |
| 7 | 2621506 | |
| 3 | 2128195 | |
| 1 | 2120749 | |
| 4 | 2023871 | |
| 8 | 1609631 | 7.0% |
| 6 | 1522751 | 6.6% |
| 5 | 1511264 | 6.5% |
| 9 | 1481118 | 6.4% |
| 0 | 1446498 | 6.2% |
| Other values (5) | 4047897 |
Latin
| Value | Count | Frequency (%) |
| O | 578271 | |
| T | 578271 | |
| N | 578271 | |
| I | 578271 | |
| P | 578271 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26047537 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2642702 | 10.1% |
| 7 | 2621506 | 10.1% |
| 3 | 2128195 | 8.2% |
| 1 | 2120749 | 8.1% |
| 4 | 2023871 | 7.8% |
| 8 | 1609631 | 6.2% |
| 6 | 1522751 | 5.8% |
| 5 | 1511264 | 5.8% |
| 9 | 1481118 | 5.7% |
| 0 | 1446498 | 5.6% |
| Other values (10) | 6939252 |
Neighborhoods
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 116 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 45029 |
| Missing (%) | 7.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.944906 |
| Minimum | 1 |
|---|---|
| Maximum | 117 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 23 |
| median | 48 |
| Q3 | 86 |
| 95-th percentile | 107 |
| Maximum | 117 |
| Range | 116 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 32.628641 |
|---|---|
| Coefficient of variation (CV) | 0.61627537 |
| Kurtosis | -1.2351318 |
| Mean | 52.944906 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 0.39822005 |
| Sum | 29959722 |
| Variance | 1064.6282 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 56186 | 9.2% |
| 53 | 46820 | 7.7% |
| 20 | 36156 | 5.9% |
| 19 | 21292 | 3.5% |
| 21 | 18392 | 3.0% |
| 86 | 12860 | 2.1% |
| 99 | 12502 | 2.0% |
| 50 | 11173 | 1.8% |
| 54 | 10919 | 1.8% |
| 39 | 10832 | 1.8% |
| Other values (106) | 328734 | |
| (Missing) | 45029 | 7.4% |
| Value | Count | Frequency (%) |
| 1 | 620 | 0.1% |
| 2 | 320 | 0.1% |
| 3 | 490 | 0.1% |
| 4 | 772 | 0.1% |
| 5 | 10399 | |
| 6 | 716 | 0.1% |
| 7 | 177 | < 0.1% |
| 8 | 10261 | |
| 9 | 4554 | |
| 10 | 1209 | 0.2% |
| Value | Count | Frequency (%) |
| 117 | 347 | 0.1% |
| 116 | 395 | 0.1% |
| 115 | 2251 | 0.4% |
| 114 | 806 | 0.1% |
| 113 | 749 | 0.1% |
| 112 | 2691 | 0.4% |
| 111 | 348 | 0.1% |
| 110 | 706 | 0.1% |
| 109 | 4764 | |
| 108 | 10421 |
ESNCAG - Boundary File
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 604133 |
| Missing (%) | 98.9% |
| Memory size | 23.4 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 20286 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 6762 | 1.1% |
| (Missing) | 604133 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 6762 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6762 | |
| . | 6762 | |
| 0 | 6762 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13524 | |
| Other Punctuation | 6762 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6762 | |
| 0 | 6762 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6762 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20286 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6762 | |
| . | 6762 | |
| 0 | 6762 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6762 | |
| . | 6762 | |
| 0 | 6762 |
Central Market/Tenderloin Boundary Polygon - Updated
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 532645 |
| Missing (%) | 87.2% |
| Memory size | 24.8 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 234750 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 78250 | 12.8% |
| (Missing) | 532645 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 78250 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 78250 | |
| . | 78250 | |
| 0 | 78250 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 156500 | |
| Other Punctuation | 78250 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 78250 | |
| 0 | 78250 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 78250 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 234750 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 78250 | |
| . | 78250 | |
| 0 | 78250 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 78250 | |
| . | 78250 | |
| 0 | 78250 |
Civic Center Harm Reduction Project Boundary
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 532929 |
| Missing (%) | 87.2% |
| Memory size | 24.8 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 233898 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 77966 | 12.8% |
| (Missing) | 532929 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 77966 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 77966 | |
| . | 77966 | |
| 0 | 77966 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 155932 | |
| Other Punctuation | 77966 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 77966 | |
| 0 | 77966 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 77966 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 233898 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 77966 | |
| . | 77966 | |
| 0 | 77966 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 233898 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 77966 | |
| . | 77966 | |
| 0 | 77966 |
HSOC Zones as of 2018-06-05
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 482105 |
| Missing (%) | 78.9% |
| Memory size | 25.8 MiB |
| 1.0 | |
|---|---|
| 3.0 | |
| 5.0 | |
| 4.0 | 2642 |
| 2.0 | 2422 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 386370 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 1.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 57713 | 9.4% |
| 3.0 | 50072 | 8.2% |
| 5.0 | 15941 | 2.6% |
| 4.0 | 2642 | 0.4% |
| 2.0 | 2422 | 0.4% |
| (Missing) | 482105 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 57713 | |
| 3.0 | 50072 | |
| 5.0 | 15941 | 12.4% |
| 4.0 | 2642 | 2.1% |
| 2.0 | 2422 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 128790 | |
| 0 | 128790 | |
| 1 | 57713 | |
| 3 | 50072 | 13.0% |
| 5 | 15941 | 4.1% |
| 4 | 2642 | 0.7% |
| 2 | 2422 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 257580 | |
| Other Punctuation | 128790 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 128790 | |
| 1 | 57713 | |
| 3 | 50072 | 19.4% |
| 5 | 15941 | 6.2% |
| 4 | 2642 | 1.0% |
| 2 | 2422 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 128790 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 386370 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 128790 | |
| 0 | 128790 | |
| 1 | 57713 | |
| 3 | 50072 | 13.0% |
| 5 | 15941 | 4.1% |
| 4 | 2642 | 0.7% |
| 2 | 2422 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 386370 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 128790 | |
| 0 | 128790 | |
| 1 | 57713 | |
| 3 | 50072 | 13.0% |
| 5 | 15941 | 4.1% |
| 4 | 2642 | 0.7% |
| 2 | 2422 | 0.6% |
Invest In Neighborhoods (IIN) Areas
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 610895 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 4.7 MiB |
Current Supervisor Districts
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32728 |
| Missing (%) | 5.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.6868673 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.3317477 |
|---|---|
| Coefficient of variation (CV) | 0.49825241 |
| Kurtosis | -1.5116889 |
| Mean | 6.6868673 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.20502285 |
| Sum | 3866126 |
| Variance | 11.100543 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 135319 | |
| 3 | 86050 | |
| 9 | 62705 | |
| 11 | 58887 | |
| 2 | 57260 | |
| 6 | 44144 | 7.2% |
| 5 | 44095 | 7.2% |
| 4 | 27563 | 4.5% |
| 8 | 24609 | 4.0% |
| 1 | 21174 | 3.5% |
| (Missing) | 32728 | 5.4% |
| Value | Count | Frequency (%) |
| 1 | 21174 | 3.5% |
| 2 | 57260 | |
| 3 | 86050 | |
| 4 | 27563 | 4.5% |
| 5 | 44095 | 7.2% |
| 6 | 44144 | 7.2% |
| 7 | 16361 | 2.7% |
| 8 | 24609 | 4.0% |
| 9 | 62705 | |
| 10 | 135319 |
| Value | Count | Frequency (%) |
| 11 | 58887 | |
| 10 | 135319 | |
| 9 | 62705 | |
| 8 | 24609 | 4.0% |
| 7 | 16361 | 2.7% |
| 6 | 44144 | 7.2% |
| 5 | 44095 | 7.2% |
| 4 | 27563 | 4.5% |
| 3 | 86050 | |
| 2 | 57260 |
Current Police Districts
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 33332 |
| Missing (%) | 5.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9035188 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.7440637 |
|---|---|
| Coefficient of variation (CV) | 0.55961113 |
| Kurtosis | -0.92846815 |
| Mean | 4.9035188 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.32412312 |
| Sum | 2832091 |
| Variance | 7.5298854 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 88142 | |
| 4 | 82202 | |
| 3 | 76155 | |
| 1 | 74353 | |
| 5 | 55338 | |
| 2 | 53542 | |
| 9 | 44047 | |
| 10 | 43889 | |
| 8 | 33261 | 5.4% |
| 7 | 26634 | 4.4% |
| (Missing) | 33332 | 5.5% |
| Value | Count | Frequency (%) |
| 1 | 74353 | |
| 2 | 53542 | |
| 3 | 76155 | |
| 4 | 82202 | |
| 5 | 55338 | |
| 6 | 88142 | |
| 7 | 26634 | 4.4% |
| 8 | 33261 | 5.4% |
| 9 | 44047 | |
| 10 | 43889 |
| Value | Count | Frequency (%) |
| 10 | 43889 | |
| 9 | 44047 | |
| 8 | 33261 | 5.4% |
| 7 | 26634 | 4.4% |
| 6 | 88142 | |
| 5 | 55338 | |
| 4 | 82202 | |
| 3 | 76155 | |
| 2 | 53542 | |
| 1 | 74353 |
| Incident Year | Row ID | Incident ID | Incident Number | CAD Number | Incident Code | CNN | Supervisor District | Latitude | Longitude | Neighborhoods | Current Supervisor Districts | Current Police Districts | Incident Day of Week | Report Type Code | Report Type Description | Incident Category | Incident Subcategory | Resolution | Police District | Analysis Neighborhood | HSOC Zones as of 2018-06-05 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Incident Year | 1.000 | 0.973 | 0.973 | 0.969 | 0.972 | -0.014 | 0.005 | 0.005 | -0.012 | -0.016 | 0.009 | 0.011 | 0.004 | 0.005 | 0.050 | 0.042 | 0.067 | 0.076 | 0.046 | 0.030 | 0.042 | 0.029 |
| Row ID | 0.973 | 1.000 | 1.000 | 0.981 | 0.998 | -0.014 | 0.007 | 0.002 | -0.011 | -0.016 | 0.008 | 0.011 | 0.005 | 0.006 | 0.053 | 0.045 | 0.053 | 0.061 | 0.049 | 0.027 | 0.037 | 0.033 |
| Incident ID | 0.973 | 1.000 | 1.000 | 0.981 | 0.998 | -0.014 | 0.007 | 0.002 | -0.011 | -0.016 | 0.008 | 0.011 | 0.005 | 0.006 | 0.053 | 0.045 | 0.053 | 0.061 | 0.049 | 0.027 | 0.037 | 0.033 |
| Incident Number | 0.969 | 0.981 | 0.981 | 1.000 | 0.997 | -0.040 | 0.020 | -0.012 | 0.003 | -0.025 | 0.010 | 0.012 | 0.008 | 0.002 | 0.050 | 0.042 | 0.084 | 0.114 | 0.046 | 0.026 | 0.033 | 0.013 |
| CAD Number | 0.972 | 0.998 | 0.998 | 0.997 | 1.000 | -0.023 | 0.002 | 0.009 | -0.013 | -0.010 | 0.008 | 0.016 | -0.004 | 0.004 | 0.063 | 0.063 | 0.071 | 0.077 | 0.151 | 0.023 | 0.030 | 0.018 |
| Incident Code | -0.014 | -0.014 | -0.014 | -0.040 | -0.023 | 1.000 | -0.058 | 0.071 | -0.070 | 0.023 | -0.011 | 0.015 | -0.014 | 0.019 | 0.163 | 0.207 | 0.824 | 0.824 | 0.243 | 0.092 | 0.092 | 0.087 |
| CNN | 0.005 | 0.007 | 0.007 | 0.020 | 0.002 | -0.058 | 1.000 | -0.624 | 0.482 | -0.362 | -0.241 | 0.147 | 0.184 | 0.008 | 0.073 | 0.069 | 0.086 | 0.107 | 0.044 | 0.490 | 0.653 | 0.163 |
| Supervisor District | 0.005 | 0.002 | 0.002 | -0.012 | 0.009 | 0.071 | -0.624 | 1.000 | -0.785 | 0.275 | 0.291 | -0.055 | -0.290 | 0.012 | 0.088 | 0.096 | 0.117 | 0.131 | 0.093 | 0.659 | 0.888 | 0.817 |
| Latitude | -0.012 | -0.011 | -0.011 | 0.003 | -0.013 | -0.070 | 0.482 | -0.785 | 1.000 | 0.149 | -0.192 | 0.094 | -0.045 | 0.011 | 0.084 | 0.083 | 0.087 | 0.107 | 0.060 | 0.472 | 0.762 | 0.603 |
| Longitude | -0.016 | -0.016 | -0.016 | -0.025 | -0.010 | 0.023 | -0.362 | 0.275 | 0.149 | 1.000 | 0.134 | 0.091 | -0.625 | 0.009 | 0.049 | 0.057 | 0.087 | 0.095 | 0.084 | 0.472 | 0.674 | 0.739 |
| Neighborhoods | 0.009 | 0.008 | 0.008 | 0.010 | 0.008 | -0.011 | -0.241 | 0.291 | -0.192 | 0.134 | 1.000 | -0.201 | 0.006 | 0.008 | 0.078 | 0.076 | 0.092 | 0.105 | 0.080 | 0.601 | 0.755 | 0.763 |
| Current Supervisor Districts | 0.011 | 0.011 | 0.011 | 0.012 | 0.016 | 0.015 | 0.147 | -0.055 | 0.094 | 0.091 | -0.201 | 1.000 | -0.298 | 0.013 | 0.087 | 0.097 | 0.116 | 0.131 | 0.092 | 0.669 | 0.887 | 0.817 |
| Current Police Districts | 0.004 | 0.005 | 0.005 | 0.008 | -0.004 | -0.014 | 0.184 | -0.290 | -0.045 | -0.625 | 0.006 | -0.298 | 1.000 | 0.013 | 0.095 | 0.102 | 0.126 | 0.135 | 0.101 | 0.942 | 0.865 | 0.629 |
| Incident Day of Week | 0.005 | 0.006 | 0.006 | 0.002 | 0.004 | 0.019 | 0.008 | 0.012 | 0.011 | 0.009 | 0.008 | 0.013 | 0.013 | 1.000 | 0.024 | 0.027 | 0.032 | 0.034 | 0.021 | 0.015 | 0.018 | 0.019 |
| Report Type Code | 0.050 | 0.053 | 0.053 | 0.050 | 0.063 | 0.163 | 0.073 | 0.088 | 0.084 | 0.049 | 0.078 | 0.087 | 0.095 | 0.024 | 1.000 | 1.000 | 0.701 | 0.753 | 0.137 | 0.201 | 0.105 | 0.045 |
| Report Type Description | 0.042 | 0.045 | 0.045 | 0.042 | 0.063 | 0.207 | 0.069 | 0.096 | 0.083 | 0.057 | 0.076 | 0.097 | 0.102 | 0.027 | 1.000 | 1.000 | 0.609 | 0.665 | 0.220 | 0.175 | 0.120 | 0.059 |
| Incident Category | 0.067 | 0.053 | 0.053 | 0.084 | 0.071 | 0.824 | 0.086 | 0.117 | 0.087 | 0.087 | 0.092 | 0.116 | 0.126 | 0.032 | 0.701 | 0.609 | 1.000 | 0.871 | 0.440 | 0.177 | 0.076 | 0.145 |
| Incident Subcategory | 0.076 | 0.061 | 0.061 | 0.114 | 0.077 | 0.824 | 0.107 | 0.131 | 0.107 | 0.095 | 0.105 | 0.131 | 0.135 | 0.034 | 0.753 | 0.665 | 0.871 | 1.000 | 0.416 | 0.184 | 0.083 | 0.149 |
| Resolution | 0.046 | 0.049 | 0.049 | 0.046 | 0.151 | 0.243 | 0.044 | 0.093 | 0.060 | 0.084 | 0.080 | 0.092 | 0.101 | 0.021 | 0.137 | 0.220 | 0.440 | 0.416 | 1.000 | 0.108 | 0.124 | 0.057 |
| Police District | 0.030 | 0.027 | 0.027 | 0.026 | 0.023 | 0.092 | 0.490 | 0.659 | 0.472 | 0.472 | 0.601 | 0.669 | 0.942 | 0.015 | 0.201 | 0.175 | 0.177 | 0.184 | 0.108 | 1.000 | 0.799 | 0.655 |
| Analysis Neighborhood | 0.042 | 0.037 | 0.037 | 0.033 | 0.030 | 0.092 | 0.653 | 0.888 | 0.762 | 0.674 | 0.755 | 0.887 | 0.865 | 0.018 | 0.105 | 0.120 | 0.076 | 0.083 | 0.124 | 0.799 | 1.000 | 0.862 |
| HSOC Zones as of 2018-06-05 | 0.029 | 0.033 | 0.033 | 0.013 | 0.018 | 0.087 | 0.163 | 0.817 | 0.603 | 0.739 | 0.763 | 0.817 | 0.629 | 0.019 | 0.045 | 0.059 | 0.145 | 0.149 | 0.057 | 0.655 | 0.862 | 1.000 |
| Incident Datetime | Incident Date | Incident Time | Incident Year | Incident Day of Week | Report Datetime | Row ID | Incident ID | Incident Number | CAD Number | Report Type Code | Report Type Description | Filed Online | Incident Code | Incident Category | Incident Subcategory | Incident Description | Resolution | Intersection | CNN | Police District | Analysis Neighborhood | Supervisor District | Latitude | Longitude | Point | Neighborhoods | ESNCAG - Boundary File | Central Market/Tenderloin Boundary Polygon - Updated | Civic Center Harm Reduction Project Boundary | HSOC Zones as of 2018-06-05 | Invest In Neighborhoods (IIN) Areas | Current Supervisor Districts | Current Police Districts | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 25-07-2021 00:00 | 25-07-2021 | 00:00 | 2021 | Sunday | 25-07-2021 13:41 | 1.057190e+11 | 1057189 | 216105573 | NaN | II | Coplogic Initial | True | 6372 | Larceny Theft | Larceny Theft - Other | Theft, Other Property, $50-$200 | Open or Active | NaN | NaN | Southern | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 28-06-2022 23:58 | 28-06-2022 | 23:58 | 2022 | Tuesday | 28-06-2022 23:58 | 1.165540e+11 | 1165543 | 220264913 | NaN | VS | Vehicle Supplement | NaN | 71012 | Other Offenses | Other Offenses | License Plate, Recovered | Open or Active | NaN | NaN | Out of SF | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 11-03-2022 10:30 | 11-03-2022 | 10:30 | 2022 | Friday | 11-03-2022 20:03 | 1.130480e+11 | 1130480 | 226040232 | NaN | II | Coplogic Initial | True | 71000 | Lost Property | Lost Property | Lost Property | Open or Active | NaN | NaN | Central | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 15-05-2021 17:47 | 15-05-2021 | 17:47 | 2021 | Saturday | 15-05-2021 17:47 | 1.030520e+11 | 1030518 | 210183345 | NaN | VS | Vehicle Supplement | NaN | 7043 | Recovered Vehicle | Recovered Vehicle | Vehicle, Recovered, Motorcycle | Open or Active | NaN | NaN | Out of SF | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 28-06-2022 17:22 | 28-06-2022 | 17:22 | 2022 | Tuesday | 28-06-2022 17:22 | 1.165350e+11 | 1165351 | 220361741 | NaN | VS | Vehicle Supplement | NaN | 7041 | Recovered Vehicle | Recovered Vehicle | Vehicle, Recovered, Auto | Open or Active | NaN | NaN | Out of SF | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 18-11-2021 13:30 | 18-11-2021 | 13:30 | 2021 | Thursday | 18-11-2021 16:24 | 1.094350e+11 | 1094352 | 216178097 | NaN | II | Coplogic Initial | True | 28150 | Malicious Mischief | Vandalism | Malicious Mischief, Vandalism to Property | Open or Active | NaN | NaN | Mission | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 28-06-2022 14:00 | 28-06-2022 | 14:00 | 2022 | Tuesday | 28-06-2022 15:09 | 1.165460e+11 | 1165462 | 226109026 | NaN | II | Coplogic Initial | True | 6244 | Larceny Theft | Larceny - From Vehicle | Theft, From Locked Vehicle, >$950 | Open or Active | NaN | NaN | Richmond | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 07-08-2022 22:00 | 07-08-2022 | 22:00 | 2022 | Sunday | 16-08-2022 15:26 | 1.182770e+11 | 1182772 | 226147327 | NaN | II | Coplogic Initial | True | 6374 | Larceny Theft | Larceny Theft - Other | Theft, Other Property, >$950 | Open or Active | NaN | NaN | Richmond | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 04-05-2022 09:38 | 04-05-2022 | 09:38 | 2022 | Wednesday | 04-05-2022 09:39 | 1.147130e+11 | 1147129 | 210618041 | NaN | VS | Vehicle Supplement | NaN | 7041 | Recovered Vehicle | Recovered Vehicle | Vehicle, Recovered, Auto | Open or Active | NaN | NaN | Out of SF | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 11-05-2018 17:30 | 11-05-2018 | 17:30 | 2018 | Friday | 13-05-2018 13:50 | 6.691847e+10 | 669184 | 186110767 | NaN | II | Coplogic Initial | True | 71000 | Lost Property | Lost Property | Lost Property | Open or Active | NaN | NaN | Out of SF | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| Incident Datetime | Incident Date | Incident Time | Incident Year | Incident Day of Week | Report Datetime | Row ID | Incident ID | Incident Number | CAD Number | Report Type Code | Report Type Description | Filed Online | Incident Code | Incident Category | Incident Subcategory | Incident Description | Resolution | Intersection | CNN | Police District | Analysis Neighborhood | Supervisor District | Latitude | Longitude | Point | Neighborhoods | ESNCAG - Boundary File | Central Market/Tenderloin Boundary Polygon - Updated | Civic Center Harm Reduction Project Boundary | HSOC Zones as of 2018-06-05 | Invest In Neighborhoods (IIN) Areas | Current Supervisor Districts | Current Police Districts | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 610885 | 01-08-2020 14:51 | 01-08-2020 | 14:51 | 2020 | Saturday | 01-08-2020 14:52 | 9.490955e+10 | 949095 | 200460567 | 202141636.0 | II | Initial | NaN | 51040 | Non-Criminal | Non-Criminal | Aided Case | Open or Active | 20TH AVE \ TARAVAL ST | 23195000.0 | Taraval | Sunset/Parkside | 4.0 | 37.743003 | -122.476765 | POINT (-122.47676460087209 37.74300263165964) | 40.0 | NaN | NaN | NaN | NaN | NaN | 7.0 | 10.0 |
| 610886 | 13-08-2020 15:00 | 13-08-2020 | 15:00 | 2020 | Thursday | 14-08-2020 09:55 | 9.527101e+10 | 952710 | 200487668 | 202271002.0 | VI | Vehicle Initial | NaN | 7023 | Motor Vehicle Theft | Motor Vehicle Theft | Vehicle, Stolen, Motorcycle | Open or Active | MANGELS AVE \ BURNSIDE AVE | 21978000.0 | Ingleside | West of Twin Peaks | 8.0 | 37.733087 | -122.438816 | POINT (-122.43881594927997 37.733086632346776) | 95.0 | NaN | NaN | NaN | NaN | NaN | 5.0 | 9.0 |
| 610887 | 26-09-2020 15:14 | 26-09-2020 | 15:14 | 2020 | Saturday | 26-09-2020 15:16 | 9.646401e+10 | 964640 | 200580234 | 202702112.0 | II | Initial | NaN | 6153 | Larceny Theft | Larceny Theft - Other | Theft, From Person, $200-$950 (other than Pickpocket) | Open or Active | BEALE ST \ MISSION ST | 24554000.0 | Central | Financial District/South Beach | 6.0 | 37.791153 | -122.395813 | POINT (-122.39581342280272 37.791152807557935) | 108.0 | NaN | NaN | NaN | NaN | NaN | 10.0 | 1.0 |
| 610888 | 10-06-2020 16:00 | 10-06-2020 | 16:00 | 2020 | Wednesday | 10-06-2020 16:00 | 9.343903e+10 | 934390 | 200328307 | NaN | IS | Initial Supplement | NaN | 28150 | Malicious Mischief | Vandalism | Malicious Mischief, Vandalism to Property | Open or Active | TEHAMA ST \ GALLAGHER LN | 28147000.0 | Southern | South of Market | 6.0 | 37.781929 | -122.403328 | POINT (-122.40332753664748 37.78192890777912) | 32.0 | NaN | NaN | NaN | NaN | NaN | 10.0 | 1.0 |
| 610889 | 20-10-2020 10:40 | 20-10-2020 | 10:40 | 2020 | Tuesday | 20-10-2020 10:41 | 9.733497e+10 | 973349 | 200620040 | 200620040.0 | IS | Initial Supplement | NaN | 71013 | Larceny Theft | Theft From Vehicle | License Plate, Stolen | Open or Active | 04TH ST \ LONG BRIDGE ST | 34168000.0 | Out of SF | Mission Bay | 6.0 | 37.773467 | -122.391434 | POINT (-122.39143433652146 37.773466920607476) | 34.0 | NaN | NaN | NaN | NaN | NaN | 10.0 | 1.0 |
| 610890 | 04-12-2020 00:00 | 04-12-2020 | 00:00 | 2020 | Friday | 05-12-2020 10:44 | 9.843021e+10 | 984302 | 200733182 | 203400924.0 | VI | Vehicle Initial | NaN | 7021 | Motor Vehicle Theft | Motor Vehicle Theft | Vehicle, Stolen, Auto | Open or Active | BAY SHORE BLVD \ COSGROVE ST | 33284000.0 | Bayview | Bayview Hunters Point | 9.0 | 37.742392 | -122.405838 | POINT (-122.40583815976386 37.74239176061754) | 82.0 | NaN | NaN | NaN | NaN | NaN | 2.0 | 2.0 |
| 610891 | 20-09-2020 00:21 | 20-09-2020 | 00:21 | 2020 | Sunday | 20-09-2020 00:21 | 9.628556e+10 | 962855 | 200566159 | 202640065.0 | II | Initial | NaN | 64085 | Other Miscellaneous | Other | Investigative Detention | Cite or Arrest Adult | ELLIS ST \ LARKIN ST | 25149000.0 | Tenderloin | Tenderloin | 6.0 | 37.784236 | -122.417707 | POINT (-122.4177067508564 37.78423573864025) | 20.0 | NaN | 1.0 | NaN | NaN | NaN | 10.0 | 5.0 |
| 610892 | 17-09-2020 06:15 | 17-09-2020 | 06:15 | 2020 | Thursday | 17-09-2020 08:51 | 9.623941e+10 | 962394 | 206135336 | NaN | II | Coplogic Initial | True | 6374 | Larceny Theft | Larceny Theft - Other | Theft, Other Property, >$950 | Open or Active | 46TH AVE \ IRVING ST | 27949000.0 | Taraval | Sunset/Parkside | 4.0 | 37.762285 | -122.506059 | POINT (-122.50605907625517 37.76228499654453) | 39.0 | NaN | NaN | NaN | NaN | NaN | 7.0 | 10.0 |
| 610893 | 08-08-2020 01:00 | 08-08-2020 | 01:00 | 2020 | Saturday | 18-08-2020 16:53 | 9.544413e+10 | 954441 | 206123773 | NaN | II | Coplogic Initial | True | 28150 | Malicious Mischief | Vandalism | Malicious Mischief, Vandalism to Property | Open or Active | 02ND ST \ NATOMA ST | 24543000.0 | Southern | Financial District/South Beach | 6.0 | 37.787203 | -122.398790 | POINT (-122.39878960122489 37.787203462687714) | 32.0 | NaN | NaN | NaN | NaN | NaN | 10.0 | 1.0 |
| 610894 | 17-01-2021 15:00 | 17-01-2021 | 15:00 | 2021 | Sunday | 17-01-2021 15:00 | 9.969712e+10 | 996971 | 210038126 | 210171798.0 | II | Initial | NaN | 19057 | Disorderly Conduct | Intimidation | Terrorist Threats | Open or Active | 04TH ST \ LONG BRIDGE ST | 34168000.0 | Southern | Mission Bay | 6.0 | 37.773467 | -122.391434 | POINT (-122.39143433652146 37.773466920607476) | 34.0 | NaN | NaN | NaN | NaN | NaN | 10.0 | 1.0 |